hello
hello

📌S Retain class distribution for seed 1:
Class 0: 4500
Class 1: 4500
Class 2: 4500
Class 3: 4500
Class 4: 4500
Class 5: 4500
Class 6: 4500
Class 7: 4500
Class 8: 4500
Class 9: 4500

📌S Forget class distribution for seed 1:
Class 0: 500
Class 1: 500
Class 2: 500
Class 3: 500
Class 4: 500
Class 5: 500
Class 6: 500
Class 7: 500
Class 8: 500
Class 9: 500
hello
hello
⚠️ Warning: Retain train loader may not be shuffled.
Training Epoch: 1 [256/45000]	Loss: 2.5034	LR: 0.000000
Training Epoch: 1 [512/45000]	Loss: 2.5235	LR: 0.000568
Training Epoch: 1 [768/45000]	Loss: 2.4993	LR: 0.001136
Training Epoch: 1 [1024/45000]	Loss: 2.4938	LR: 0.001705
Training Epoch: 1 [1280/45000]	Loss: 2.3769	LR: 0.002273
Training Epoch: 1 [1536/45000]	Loss: 2.3239	LR: 0.002841
Training Epoch: 1 [1792/45000]	Loss: 2.1899	LR: 0.003409
Training Epoch: 1 [2048/45000]	Loss: 1.9639	LR: 0.003977
Training Epoch: 1 [2304/45000]	Loss: 1.7747	LR: 0.004545
Training Epoch: 1 [2560/45000]	Loss: 1.6085	LR: 0.005114
Training Epoch: 1 [2816/45000]	Loss: 1.3375	LR: 0.005682
Training Epoch: 1 [3072/45000]	Loss: 1.1036	LR: 0.006250
Training Epoch: 1 [3328/45000]	Loss: 0.9602	LR: 0.006818
Training Epoch: 1 [3584/45000]	Loss: 0.7371	LR: 0.007386
Training Epoch: 1 [3840/45000]	Loss: 0.6021	LR: 0.007955
Training Epoch: 1 [4096/45000]	Loss: 0.4909	LR: 0.008523
Training Epoch: 1 [4352/45000]	Loss: 0.3513	LR: 0.009091
Training Epoch: 1 [4608/45000]	Loss: 0.3706	LR: 0.009659
Training Epoch: 1 [4864/45000]	Loss: 0.2932	LR: 0.010227
Training Epoch: 1 [5120/45000]	Loss: 0.2993	LR: 0.010795
Training Epoch: 1 [5376/45000]	Loss: 0.2284	LR: 0.011364
Training Epoch: 1 [5632/45000]	Loss: 0.2934	LR: 0.011932
Training Epoch: 1 [5888/45000]	Loss: 0.1791	LR: 0.012500
Training Epoch: 1 [6144/45000]	Loss: 0.2166	LR: 0.013068
Training Epoch: 1 [6400/45000]	Loss: 0.2223	LR: 0.013636
Training Epoch: 1 [6656/45000]	Loss: 0.2513	LR: 0.014205
Training Epoch: 1 [6912/45000]	Loss: 0.2388	LR: 0.014773
Training Epoch: 1 [7168/45000]	Loss: 0.1818	LR: 0.015341
Training Epoch: 1 [7424/45000]	Loss: 0.1510	LR: 0.015909
Training Epoch: 1 [7680/45000]	Loss: 0.3198	LR: 0.016477
Training Epoch: 1 [7936/45000]	Loss: 0.3157	LR: 0.017045
Training Epoch: 1 [8192/45000]	Loss: 0.1551	LR: 0.017614
Training Epoch: 1 [8448/45000]	Loss: 0.2891	LR: 0.018182
Training Epoch: 1 [8704/45000]	Loss: 0.2011	LR: 0.018750
Training Epoch: 1 [8960/45000]	Loss: 0.2515	LR: 0.019318
Training Epoch: 1 [9216/45000]	Loss: 0.2200	LR: 0.019886
Training Epoch: 1 [9472/45000]	Loss: 0.2910	LR: 0.020455
Training Epoch: 1 [9728/45000]	Loss: 0.3263	LR: 0.021023
Training Epoch: 1 [9984/45000]	Loss: 0.2319	LR: 0.021591
Training Epoch: 1 [10240/45000]	Loss: 0.2604	LR: 0.022159
Training Epoch: 1 [10496/45000]	Loss: 0.2201	LR: 0.022727
Training Epoch: 1 [10752/45000]	Loss: 0.2428	LR: 0.023295
Training Epoch: 1 [11008/45000]	Loss: 0.2275	LR: 0.023864
Training Epoch: 1 [11264/45000]	Loss: 0.2310	LR: 0.024432
Training Epoch: 1 [11520/45000]	Loss: 0.2033	LR: 0.025000
Training Epoch: 1 [11776/45000]	Loss: 0.2938	LR: 0.025568
Training Epoch: 1 [12032/45000]	Loss: 0.1936	LR: 0.026136
Training Epoch: 1 [12288/45000]	Loss: 0.2794	LR: 0.026705
Training Epoch: 1 [12544/45000]	Loss: 0.2220	LR: 0.027273
Training Epoch: 1 [12800/45000]	Loss: 0.2043	LR: 0.027841
Training Epoch: 1 [13056/45000]	Loss: 0.2373	LR: 0.028409
Training Epoch: 1 [13312/45000]	Loss: 0.2479	LR: 0.028977
Training Epoch: 1 [13568/45000]	Loss: 0.3464	LR: 0.029545
Training Epoch: 1 [13824/45000]	Loss: 0.1846	LR: 0.030114
Training Epoch: 1 [14080/45000]	Loss: 0.2335	LR: 0.030682
Training Epoch: 1 [14336/45000]	Loss: 0.3099	LR: 0.031250
Training Epoch: 1 [14592/45000]	Loss: 0.3220	LR: 0.031818
Training Epoch: 1 [14848/45000]	Loss: 0.2626	LR: 0.032386
Training Epoch: 1 [15104/45000]	Loss: 0.2250	LR: 0.032955
Training Epoch: 1 [15360/45000]	Loss: 0.3077	LR: 0.033523
Training Epoch: 1 [15616/45000]	Loss: 0.4470	LR: 0.034091
Training Epoch: 1 [15872/45000]	Loss: 0.3024	LR: 0.034659
Training Epoch: 1 [16128/45000]	Loss: 0.4094	LR: 0.035227
Training Epoch: 1 [16384/45000]	Loss: 0.3837	LR: 0.035795
Training Epoch: 1 [16640/45000]	Loss: 0.3902	LR: 0.036364
Training Epoch: 1 [16896/45000]	Loss: 0.4742	LR: 0.036932
Training Epoch: 1 [17152/45000]	Loss: 0.3065	LR: 0.037500
Training Epoch: 1 [17408/45000]	Loss: 0.2709	LR: 0.038068
Training Epoch: 1 [17664/45000]	Loss: 0.2662	LR: 0.038636
Training Epoch: 1 [17920/45000]	Loss: 0.2204	LR: 0.039205
Training Epoch: 1 [18176/45000]	Loss: 0.1332	LR: 0.039773
Training Epoch: 1 [18432/45000]	Loss: 0.2652	LR: 0.040341
Training Epoch: 1 [18688/45000]	Loss: 0.3625	LR: 0.040909
Training Epoch: 1 [18944/45000]	Loss: 0.2949	LR: 0.041477
Training Epoch: 1 [19200/45000]	Loss: 0.3254	LR: 0.042045
Training Epoch: 1 [19456/45000]	Loss: 0.1896	LR: 0.042614
Training Epoch: 1 [19712/45000]	Loss: 0.2851	LR: 0.043182
Training Epoch: 1 [19968/45000]	Loss: 0.2098	LR: 0.043750
Training Epoch: 1 [20224/45000]	Loss: 0.3457	LR: 0.044318
Training Epoch: 1 [20480/45000]	Loss: 0.2423	LR: 0.044886
Training Epoch: 1 [20736/45000]	Loss: 0.1780	LR: 0.045455
Training Epoch: 1 [20992/45000]	Loss: 0.2508	LR: 0.046023
Training Epoch: 1 [21248/45000]	Loss: 0.2161	LR: 0.046591
Training Epoch: 1 [21504/45000]	Loss: 0.1928	LR: 0.047159
Training Epoch: 1 [21760/45000]	Loss: 0.3261	LR: 0.047727
Training Epoch: 1 [22016/45000]	Loss: 0.2653	LR: 0.048295
Training Epoch: 1 [22272/45000]	Loss: 0.1945	LR: 0.048864
Training Epoch: 1 [22528/45000]	Loss: 0.2221	LR: 0.049432
Training Epoch: 1 [22784/45000]	Loss: 0.2424	LR: 0.050000
Training Epoch: 1 [23040/45000]	Loss: 0.2239	LR: 0.050568
Training Epoch: 1 [23296/45000]	Loss: 0.2517	LR: 0.051136
Training Epoch: 1 [23552/45000]	Loss: 0.2806	LR: 0.051705
Training Epoch: 1 [23808/45000]	Loss: 0.2047	LR: 0.052273
Training Epoch: 1 [24064/45000]	Loss: 0.3205	LR: 0.052841
Training Epoch: 1 [24320/45000]	Loss: 0.2390	LR: 0.053409
Training Epoch: 1 [24576/45000]	Loss: 0.2660	LR: 0.053977
Training Epoch: 1 [24832/45000]	Loss: 0.2495	LR: 0.054545
Training Epoch: 1 [25088/45000]	Loss: 0.2296	LR: 0.055114
Training Epoch: 1 [25344/45000]	Loss: 0.1345	LR: 0.055682
Training Epoch: 1 [25600/45000]	Loss: 0.3124	LR: 0.056250
Training Epoch: 1 [25856/45000]	Loss: 0.2109	LR: 0.056818
Training Epoch: 1 [26112/45000]	Loss: 0.2893	LR: 0.057386
Training Epoch: 1 [26368/45000]	Loss: 0.1788	LR: 0.057955
Training Epoch: 1 [26624/45000]	Loss: 0.2520	LR: 0.058523
Training Epoch: 1 [26880/45000]	Loss: 0.2451	LR: 0.059091
Training Epoch: 1 [27136/45000]	Loss: 0.2311	LR: 0.059659
Training Epoch: 1 [27392/45000]	Loss: 0.2185	LR: 0.060227
Training Epoch: 1 [27648/45000]	Loss: 0.1928	LR: 0.060795
Training Epoch: 1 [27904/45000]	Loss: 0.1855	LR: 0.061364
Training Epoch: 1 [28160/45000]	Loss: 0.2339	LR: 0.061932
Training Epoch: 1 [28416/45000]	Loss: 0.1780	LR: 0.062500
Training Epoch: 1 [28672/45000]	Loss: 0.2412	LR: 0.063068
Training Epoch: 1 [28928/45000]	Loss: 0.1666	LR: 0.063636
Training Epoch: 1 [29184/45000]	Loss: 0.1171	LR: 0.064205
Training Epoch: 1 [29440/45000]	Loss: 0.2267	LR: 0.064773
Training Epoch: 1 [29696/45000]	Loss: 0.1831	LR: 0.065341
Training Epoch: 1 [29952/45000]	Loss: 0.1319	LR: 0.065909
Training Epoch: 1 [30208/45000]	Loss: 0.2749	LR: 0.066477
Training Epoch: 1 [30464/45000]	Loss: 0.1472	LR: 0.067045
Training Epoch: 1 [30720/45000]	Loss: 0.2334	LR: 0.067614
Training Epoch: 1 [30976/45000]	Loss: 0.2199	LR: 0.068182
Training Epoch: 1 [31232/45000]	Loss: 0.2520	LR: 0.068750
Training Epoch: 1 [31488/45000]	Loss: 0.1702	LR: 0.069318
Training Epoch: 1 [31744/45000]	Loss: 0.3719	LR: 0.069886
Training Epoch: 1 [32000/45000]	Loss: 0.2475	LR: 0.070455
Training Epoch: 1 [32256/45000]	Loss: 0.1633	LR: 0.071023
Training Epoch: 1 [32512/45000]	Loss: 0.3728	LR: 0.071591
Training Epoch: 1 [32768/45000]	Loss: 0.2879	LR: 0.072159
Training Epoch: 1 [33024/45000]	Loss: 0.2013	LR: 0.072727
Training Epoch: 1 [33280/45000]	Loss: 0.1922	LR: 0.073295
Training Epoch: 1 [33536/45000]	Loss: 0.1735	LR: 0.073864
Training Epoch: 1 [33792/45000]	Loss: 0.3300	LR: 0.074432
Training Epoch: 1 [34048/45000]	Loss: 0.1904	LR: 0.075000
Training Epoch: 1 [34304/45000]	Loss: 0.2240	LR: 0.075568
Training Epoch: 1 [34560/45000]	Loss: 0.3323	LR: 0.076136
Training Epoch: 1 [34816/45000]	Loss: 0.1492	LR: 0.076705
Training Epoch: 1 [35072/45000]	Loss: 0.1772	LR: 0.077273
Training Epoch: 1 [35328/45000]	Loss: 0.1544	LR: 0.077841
Training Epoch: 1 [35584/45000]	Loss: 0.2126	LR: 0.078409
Training Epoch: 1 [35840/45000]	Loss: 0.1901	LR: 0.078977
Training Epoch: 1 [36096/45000]	Loss: 0.1816	LR: 0.079545
Training Epoch: 1 [36352/45000]	Loss: 0.2185	LR: 0.080114
Training Epoch: 1 [36608/45000]	Loss: 0.2607	LR: 0.080682
Training Epoch: 1 [36864/45000]	Loss: 0.1755	LR: 0.081250
Training Epoch: 1 [37120/45000]	Loss: 0.1062	LR: 0.081818
Training Epoch: 1 [37376/45000]	Loss: 0.2014	LR: 0.082386
Training Epoch: 1 [37632/45000]	Loss: 0.1543	LR: 0.082955
Training Epoch: 1 [37888/45000]	Loss: 0.1607	LR: 0.083523
Training Epoch: 1 [38144/45000]	Loss: 0.2195	LR: 0.084091
Training Epoch: 1 [38400/45000]	Loss: 0.2175	LR: 0.084659
Training Epoch: 1 [38656/45000]	Loss: 0.1780	LR: 0.085227
Training Epoch: 1 [38912/45000]	Loss: 0.2890	LR: 0.085795
Training Epoch: 1 [39168/45000]	Loss: 0.4736	LR: 0.086364
Training Epoch: 1 [39424/45000]	Loss: 0.3040	LR: 0.086932
Training Epoch: 1 [39680/45000]	Loss: 0.3507	LR: 0.087500
Training Epoch: 1 [39936/45000]	Loss: 0.2496	LR: 0.088068
Training Epoch: 1 [40192/45000]	Loss: 0.3704	LR: 0.088636
Training Epoch: 1 [40448/45000]	Loss: 0.2218	LR: 0.089205
Training Epoch: 1 [40704/45000]	Loss: 0.3198	LR: 0.089773
Training Epoch: 1 [40960/45000]	Loss: 0.3970	LR: 0.090341
Training Epoch: 1 [41216/45000]	Loss: 0.2757	LR: 0.090909
Training Epoch: 1 [41472/45000]	Loss: 0.2117	LR: 0.091477
Training Epoch: 1 [41728/45000]	Loss: 0.2741	LR: 0.092045
Training Epoch: 1 [41984/45000]	Loss: 0.3238	LR: 0.092614
Training Epoch: 1 [42240/45000]	Loss: 0.2947	LR: 0.093182
Training Epoch: 1 [42496/45000]	Loss: 0.2189	LR: 0.093750
Training Epoch: 1 [42752/45000]	Loss: 0.2439	LR: 0.094318
Training Epoch: 1 [43008/45000]	Loss: 0.2367	LR: 0.094886
Training Epoch: 1 [43264/45000]	Loss: 0.4029	LR: 0.095455
Training Epoch: 1 [43520/45000]	Loss: 0.3012	LR: 0.096023
Training Epoch: 1 [43776/45000]	Loss: 0.2884	LR: 0.096591
Training Epoch: 1 [44032/45000]	Loss: 0.4188	LR: 0.097159
Training Epoch: 1 [44288/45000]	Loss: 1.7915	LR: 0.097727
Training Epoch: 1 [44544/45000]	Loss: 0.8842	LR: 0.098295
Training Epoch: 1 [44800/45000]	Loss: 0.6419	LR: 0.098864
Training Epoch: 1 [45000/45000]	Loss: 1.0059	LR: 0.099432
Epoch 1 - Average Train Loss: 0.4036, Train Accuracy: 0.8708
Epoch 1 training time consumed: 326.06s
Evaluating Network.....
Test set: Epoch: 1, Average loss: 0.0029, Accuracy: 0.7750, Time consumed:25.50s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-1-best.pth
Training Epoch: 2 [256/45000]	Loss: 0.9496	LR: 0.100000
Training Epoch: 2 [512/45000]	Loss: 0.8780	LR: 0.100000
Training Epoch: 2 [768/45000]	Loss: 0.9485	LR: 0.100000
Training Epoch: 2 [1024/45000]	Loss: 1.0026	LR: 0.100000
Training Epoch: 2 [1280/45000]	Loss: 3.4604	LR: 0.100000
Training Epoch: 2 [1536/45000]	Loss: 2.8888	LR: 0.100000
Training Epoch: 2 [1792/45000]	Loss: 2.6799	LR: 0.100000
Training Epoch: 2 [2048/45000]	Loss: 2.5660	LR: 0.100000
Training Epoch: 2 [2304/45000]	Loss: 2.5286	LR: 0.100000
Training Epoch: 2 [2560/45000]	Loss: 2.3540	LR: 0.100000
Training Epoch: 2 [2816/45000]	Loss: 2.4440	LR: 0.100000
Training Epoch: 2 [3072/45000]	Loss: 2.3697	LR: 0.100000
Training Epoch: 2 [3328/45000]	Loss: 2.3758	LR: 0.100000
Training Epoch: 2 [3584/45000]	Loss: 2.3470	LR: 0.100000
Training Epoch: 2 [3840/45000]	Loss: 2.3465	LR: 0.100000
Training Epoch: 2 [4096/45000]	Loss: 2.2789	LR: 0.100000
Training Epoch: 2 [4352/45000]	Loss: 2.2994	LR: 0.100000
Training Epoch: 2 [4608/45000]	Loss: 2.2313	LR: 0.100000
Training Epoch: 2 [4864/45000]	Loss: 2.2268	LR: 0.100000
Training Epoch: 2 [5120/45000]	Loss: 2.2253	LR: 0.100000
Training Epoch: 2 [5376/45000]	Loss: 2.3100	LR: 0.100000
Training Epoch: 2 [5632/45000]	Loss: 2.2291	LR: 0.100000
Training Epoch: 2 [5888/45000]	Loss: 2.2488	LR: 0.100000
Training Epoch: 2 [6144/45000]	Loss: 2.1974	LR: 0.100000
Training Epoch: 2 [6400/45000]	Loss: 2.2794	LR: 0.100000
Training Epoch: 2 [6656/45000]	Loss: 2.2186	LR: 0.100000
Training Epoch: 2 [6912/45000]	Loss: 2.2104	LR: 0.100000
Training Epoch: 2 [7168/45000]	Loss: 2.1932	LR: 0.100000
Training Epoch: 2 [7424/45000]	Loss: 2.1498	LR: 0.100000
Training Epoch: 2 [7680/45000]	Loss: 2.1790	LR: 0.100000
Training Epoch: 2 [7936/45000]	Loss: 2.1697	LR: 0.100000
Training Epoch: 2 [8192/45000]	Loss: 2.1931	LR: 0.100000
Training Epoch: 2 [8448/45000]	Loss: 2.1334	LR: 0.100000
Training Epoch: 2 [8704/45000]	Loss: 2.1300	LR: 0.100000
Training Epoch: 2 [8960/45000]	Loss: 2.1586	LR: 0.100000
Training Epoch: 2 [9216/45000]	Loss: 2.1427	LR: 0.100000
Training Epoch: 2 [9472/45000]	Loss: 2.1429	LR: 0.100000
Training Epoch: 2 [9728/45000]	Loss: 2.1256	LR: 0.100000
Training Epoch: 2 [9984/45000]	Loss: 2.0858	LR: 0.100000
Training Epoch: 2 [10240/45000]	Loss: 2.1659	LR: 0.100000
Training Epoch: 2 [10496/45000]	Loss: 2.0862	LR: 0.100000
Training Epoch: 2 [10752/45000]	Loss: 2.2093	LR: 0.100000
Training Epoch: 2 [11008/45000]	Loss: 2.1088	LR: 0.100000
Training Epoch: 2 [11264/45000]	Loss: 2.0911	LR: 0.100000
Training Epoch: 2 [11520/45000]	Loss: 2.1039	LR: 0.100000
Training Epoch: 2 [11776/45000]	Loss: 2.1520	LR: 0.100000
Training Epoch: 2 [12032/45000]	Loss: 2.2435	LR: 0.100000
Training Epoch: 2 [12288/45000]	Loss: 2.0149	LR: 0.100000
Training Epoch: 2 [12544/45000]	Loss: 2.2223	LR: 0.100000
Training Epoch: 2 [12800/45000]	Loss: 2.0926	LR: 0.100000
Training Epoch: 2 [13056/45000]	Loss: 2.1126	LR: 0.100000
Training Epoch: 2 [13312/45000]	Loss: 2.0292	LR: 0.100000
Training Epoch: 2 [13568/45000]	Loss: 1.9338	LR: 0.100000
Training Epoch: 2 [13824/45000]	Loss: 2.0720	LR: 0.100000
Training Epoch: 2 [14080/45000]	Loss: 2.0592	LR: 0.100000
Training Epoch: 2 [14336/45000]	Loss: 2.0663	LR: 0.100000
Training Epoch: 2 [14592/45000]	Loss: 2.0093	LR: 0.100000
Training Epoch: 2 [14848/45000]	Loss: 2.0357	LR: 0.100000
Training Epoch: 2 [15104/45000]	Loss: 2.0703	LR: 0.100000
Training Epoch: 2 [15360/45000]	Loss: 1.9483	LR: 0.100000
Training Epoch: 2 [15616/45000]	Loss: 1.9813	LR: 0.100000
Training Epoch: 2 [15872/45000]	Loss: 1.9713	LR: 0.100000
Training Epoch: 2 [16128/45000]	Loss: 2.0593	LR: 0.100000
Training Epoch: 2 [16384/45000]	Loss: 1.9783	LR: 0.100000
Training Epoch: 2 [16640/45000]	Loss: 1.9597	LR: 0.100000
Training Epoch: 2 [16896/45000]	Loss: 1.9712	LR: 0.100000
Training Epoch: 2 [17152/45000]	Loss: 2.0397	LR: 0.100000
Training Epoch: 2 [17408/45000]	Loss: 1.9631	LR: 0.100000
Training Epoch: 2 [17664/45000]	Loss: 2.0469	LR: 0.100000
Training Epoch: 2 [17920/45000]	Loss: 2.0565	LR: 0.100000
Training Epoch: 2 [18176/45000]	Loss: 1.9691	LR: 0.100000
Training Epoch: 2 [18432/45000]	Loss: 2.0618	LR: 0.100000
Training Epoch: 2 [18688/45000]	Loss: 2.0223	LR: 0.100000
Training Epoch: 2 [18944/45000]	Loss: 2.0238	LR: 0.100000
Training Epoch: 2 [19200/45000]	Loss: 1.9755	LR: 0.100000
Training Epoch: 2 [19456/45000]	Loss: 2.0830	LR: 0.100000
Training Epoch: 2 [19712/45000]	Loss: 1.9565	LR: 0.100000
Training Epoch: 2 [19968/45000]	Loss: 2.0259	LR: 0.100000
Training Epoch: 2 [20224/45000]	Loss: 1.9938	LR: 0.100000
Training Epoch: 2 [20480/45000]	Loss: 1.9119	LR: 0.100000
Training Epoch: 2 [20736/45000]	Loss: 1.8932	LR: 0.100000
Training Epoch: 2 [20992/45000]	Loss: 2.0295	LR: 0.100000
Training Epoch: 2 [21248/45000]	Loss: 1.9914	LR: 0.100000
Training Epoch: 2 [21504/45000]	Loss: 1.9207	LR: 0.100000
Training Epoch: 2 [21760/45000]	Loss: 2.0398	LR: 0.100000
Training Epoch: 2 [22016/45000]	Loss: 1.9513	LR: 0.100000
Training Epoch: 2 [22272/45000]	Loss: 1.9240	LR: 0.100000
Training Epoch: 2 [22528/45000]	Loss: 2.0012	LR: 0.100000
Training Epoch: 2 [22784/45000]	Loss: 2.0394	LR: 0.100000
Training Epoch: 2 [23040/45000]	Loss: 1.8986	LR: 0.100000
Training Epoch: 2 [23296/45000]	Loss: 2.0445	LR: 0.100000
Training Epoch: 2 [23552/45000]	Loss: 1.9965	LR: 0.100000
Training Epoch: 2 [23808/45000]	Loss: 1.9879	LR: 0.100000
Training Epoch: 2 [24064/45000]	Loss: 1.9523	LR: 0.100000
Training Epoch: 2 [24320/45000]	Loss: 1.8955	LR: 0.100000
Training Epoch: 2 [24576/45000]	Loss: 1.9651	LR: 0.100000
Training Epoch: 2 [24832/45000]	Loss: 1.8683	LR: 0.100000
Training Epoch: 2 [25088/45000]	Loss: 1.9261	LR: 0.100000
Training Epoch: 2 [25344/45000]	Loss: 1.8534	LR: 0.100000
Training Epoch: 2 [25600/45000]	Loss: 1.8612	LR: 0.100000
Training Epoch: 2 [25856/45000]	Loss: 1.9152	LR: 0.100000
Training Epoch: 2 [26112/45000]	Loss: 1.8533	LR: 0.100000
Training Epoch: 2 [26368/45000]	Loss: 1.9019	LR: 0.100000
Training Epoch: 2 [26624/45000]	Loss: 1.9332	LR: 0.100000
Training Epoch: 2 [26880/45000]	Loss: 1.9042	LR: 0.100000
Training Epoch: 2 [27136/45000]	Loss: 1.9822	LR: 0.100000
Training Epoch: 2 [27392/45000]	Loss: 1.8872	LR: 0.100000
Training Epoch: 2 [27648/45000]	Loss: 1.8967	LR: 0.100000
Training Epoch: 2 [27904/45000]	Loss: 1.8292	LR: 0.100000
Training Epoch: 2 [28160/45000]	Loss: 1.9457	LR: 0.100000
Training Epoch: 2 [28416/45000]	Loss: 1.8603	LR: 0.100000
Training Epoch: 2 [28672/45000]	Loss: 1.9387	LR: 0.100000
Training Epoch: 2 [28928/45000]	Loss: 1.9175	LR: 0.100000
Training Epoch: 2 [29184/45000]	Loss: 1.9250	LR: 0.100000
Training Epoch: 2 [29440/45000]	Loss: 1.9795	LR: 0.100000
Training Epoch: 2 [29696/45000]	Loss: 1.8957	LR: 0.100000
Training Epoch: 2 [29952/45000]	Loss: 1.8955	LR: 0.100000
Training Epoch: 2 [30208/45000]	Loss: 1.9141	LR: 0.100000
Training Epoch: 2 [30464/45000]	Loss: 1.9229	LR: 0.100000
Training Epoch: 2 [30720/45000]	Loss: 1.9311	LR: 0.100000
Training Epoch: 2 [30976/45000]	Loss: 1.9722	LR: 0.100000
Training Epoch: 2 [31232/45000]	Loss: 1.8652	LR: 0.100000
Training Epoch: 2 [31488/45000]	Loss: 1.9464	LR: 0.100000
Training Epoch: 2 [31744/45000]	Loss: 2.0081	LR: 0.100000
Training Epoch: 2 [32000/45000]	Loss: 1.8690	LR: 0.100000
Training Epoch: 2 [32256/45000]	Loss: 1.8656	LR: 0.100000
Training Epoch: 2 [32512/45000]	Loss: 1.9147	LR: 0.100000
Training Epoch: 2 [32768/45000]	Loss: 1.8463	LR: 0.100000
Training Epoch: 2 [33024/45000]	Loss: 1.8941	LR: 0.100000
Training Epoch: 2 [33280/45000]	Loss: 1.9581	LR: 0.100000
Training Epoch: 2 [33536/45000]	Loss: 1.8339	LR: 0.100000
Training Epoch: 2 [33792/45000]	Loss: 1.7939	LR: 0.100000
Training Epoch: 2 [34048/45000]	Loss: 1.7992	LR: 0.100000
Training Epoch: 2 [34304/45000]	Loss: 1.9212	LR: 0.100000
Training Epoch: 2 [34560/45000]	Loss: 1.8938	LR: 0.100000
Training Epoch: 2 [34816/45000]	Loss: 1.9383	LR: 0.100000
Training Epoch: 2 [35072/45000]	Loss: 1.7921	LR: 0.100000
Training Epoch: 2 [35328/45000]	Loss: 1.7505	LR: 0.100000
Training Epoch: 2 [35584/45000]	Loss: 1.8719	LR: 0.100000
Training Epoch: 2 [35840/45000]	Loss: 1.8334	LR: 0.100000
Training Epoch: 2 [36096/45000]	Loss: 1.7688	LR: 0.100000
Training Epoch: 2 [36352/45000]	Loss: 1.7231	LR: 0.100000
Training Epoch: 2 [36608/45000]	Loss: 1.8594	LR: 0.100000
Training Epoch: 2 [36864/45000]	Loss: 1.7070	LR: 0.100000
Training Epoch: 2 [37120/45000]	Loss: 1.8490	LR: 0.100000
Training Epoch: 2 [37376/45000]	Loss: 1.9157	LR: 0.100000
Training Epoch: 2 [37632/45000]	Loss: 1.7757	LR: 0.100000
Training Epoch: 2 [37888/45000]	Loss: 1.7405	LR: 0.100000
Training Epoch: 2 [38144/45000]	Loss: 1.7614	LR: 0.100000
Training Epoch: 2 [38400/45000]	Loss: 1.7687	LR: 0.100000
Training Epoch: 2 [38656/45000]	Loss: 1.8186	LR: 0.100000
Training Epoch: 2 [38912/45000]	Loss: 1.7845	LR: 0.100000
Training Epoch: 2 [39168/45000]	Loss: 1.7820	LR: 0.100000
Training Epoch: 2 [39424/45000]	Loss: 1.8239	LR: 0.100000
Training Epoch: 2 [39680/45000]	Loss: 1.7065	LR: 0.100000
Training Epoch: 2 [39936/45000]	Loss: 1.7567	LR: 0.100000
Training Epoch: 2 [40192/45000]	Loss: 1.7884	LR: 0.100000
Training Epoch: 2 [40448/45000]	Loss: 1.7818	LR: 0.100000
Training Epoch: 2 [40704/45000]	Loss: 1.7860	LR: 0.100000
Training Epoch: 2 [40960/45000]	Loss: 1.7548	LR: 0.100000
Training Epoch: 2 [41216/45000]	Loss: 1.7906	LR: 0.100000
Training Epoch: 2 [41472/45000]	Loss: 1.7391	LR: 0.100000
Training Epoch: 2 [41728/45000]	Loss: 1.8060	LR: 0.100000
Training Epoch: 2 [41984/45000]	Loss: 1.7186	LR: 0.100000
Training Epoch: 2 [42240/45000]	Loss: 1.7010	LR: 0.100000
Training Epoch: 2 [42496/45000]	Loss: 1.7459	LR: 0.100000
Training Epoch: 2 [42752/45000]	Loss: 1.6357	LR: 0.100000
Training Epoch: 2 [43008/45000]	Loss: 1.8043	LR: 0.100000
Training Epoch: 2 [43264/45000]	Loss: 1.8935	LR: 0.100000
Training Epoch: 2 [43520/45000]	Loss: 1.7312	LR: 0.100000
Training Epoch: 2 [43776/45000]	Loss: 1.7563	LR: 0.100000
Training Epoch: 2 [44032/45000]	Loss: 1.6927	LR: 0.100000
Training Epoch: 2 [44288/45000]	Loss: 1.7374	LR: 0.100000
Training Epoch: 2 [44544/45000]	Loss: 1.6901	LR: 0.100000
Training Epoch: 2 [44800/45000]	Loss: 1.7703	LR: 0.100000
Training Epoch: 2 [45000/45000]	Loss: 1.7857	LR: 0.100000
Epoch 2 - Average Train Loss: 1.9733, Train Accuracy: 0.2651
Epoch 2 training time consumed: 324.57s
Evaluating Network.....
Test set: Epoch: 2, Average loss: 0.0071, Accuracy: 0.3536, Time consumed:23.43s
Training Epoch: 3 [256/45000]	Loss: 1.6680	LR: 0.100000
Training Epoch: 3 [512/45000]	Loss: 1.7495	LR: 0.100000
Training Epoch: 3 [768/45000]	Loss: 1.6855	LR: 0.100000
Training Epoch: 3 [1024/45000]	Loss: 1.6983	LR: 0.100000
Training Epoch: 3 [1280/45000]	Loss: 1.7940	LR: 0.100000
Training Epoch: 3 [1536/45000]	Loss: 1.6901	LR: 0.100000
Training Epoch: 3 [1792/45000]	Loss: 1.8907	LR: 0.100000
Training Epoch: 3 [2048/45000]	Loss: 1.6743	LR: 0.100000
Training Epoch: 3 [2304/45000]	Loss: 1.7380	LR: 0.100000
Training Epoch: 3 [2560/45000]	Loss: 1.6917	LR: 0.100000
Training Epoch: 3 [2816/45000]	Loss: 1.8546	LR: 0.100000
Training Epoch: 3 [3072/45000]	Loss: 1.7227	LR: 0.100000
Training Epoch: 3 [3328/45000]	Loss: 1.6104	LR: 0.100000
Training Epoch: 3 [3584/45000]	Loss: 1.7693	LR: 0.100000
Training Epoch: 3 [3840/45000]	Loss: 1.7577	LR: 0.100000
Training Epoch: 3 [4096/45000]	Loss: 1.7447	LR: 0.100000
Training Epoch: 3 [4352/45000]	Loss: 1.7067	LR: 0.100000
Training Epoch: 3 [4608/45000]	Loss: 1.6484	LR: 0.100000
Training Epoch: 3 [4864/45000]	Loss: 1.6151	LR: 0.100000
Training Epoch: 3 [5120/45000]	Loss: 1.6608	LR: 0.100000
Training Epoch: 3 [5376/45000]	Loss: 1.6611	LR: 0.100000
Training Epoch: 3 [5632/45000]	Loss: 1.6939	LR: 0.100000
Training Epoch: 3 [5888/45000]	Loss: 1.6073	LR: 0.100000
Training Epoch: 3 [6144/45000]	Loss: 1.7393	LR: 0.100000
Training Epoch: 3 [6400/45000]	Loss: 1.6236	LR: 0.100000
Training Epoch: 3 [6656/45000]	Loss: 1.6375	LR: 0.100000
Training Epoch: 3 [6912/45000]	Loss: 1.6421	LR: 0.100000
Training Epoch: 3 [7168/45000]	Loss: 1.7416	LR: 0.100000
Training Epoch: 3 [7424/45000]	Loss: 1.6028	LR: 0.100000
Training Epoch: 3 [7680/45000]	Loss: 1.6661	LR: 0.100000
Training Epoch: 3 [7936/45000]	Loss: 1.5363	LR: 0.100000
Training Epoch: 3 [8192/45000]	Loss: 1.6019	LR: 0.100000
Training Epoch: 3 [8448/45000]	Loss: 1.6040	LR: 0.100000
Training Epoch: 3 [8704/45000]	Loss: 1.7043	LR: 0.100000
Training Epoch: 3 [8960/45000]	Loss: 1.6062	LR: 0.100000
Training Epoch: 3 [9216/45000]	Loss: 1.7024	LR: 0.100000
Training Epoch: 3 [9472/45000]	Loss: 1.6664	LR: 0.100000
Training Epoch: 3 [9728/45000]	Loss: 1.6789	LR: 0.100000
Training Epoch: 3 [9984/45000]	Loss: 1.6522	LR: 0.100000
Training Epoch: 3 [10240/45000]	Loss: 1.6414	LR: 0.100000
Training Epoch: 3 [10496/45000]	Loss: 1.5851	LR: 0.100000
Training Epoch: 3 [10752/45000]	Loss: 1.6828	LR: 0.100000
Training Epoch: 3 [11008/45000]	Loss: 1.7618	LR: 0.100000
Training Epoch: 3 [11264/45000]	Loss: 1.6320	LR: 0.100000
Training Epoch: 3 [11520/45000]	Loss: 1.6090	LR: 0.100000
Training Epoch: 3 [11776/45000]	Loss: 1.6831	LR: 0.100000
Training Epoch: 3 [12032/45000]	Loss: 1.6072	LR: 0.100000
Training Epoch: 3 [12288/45000]	Loss: 1.5336	LR: 0.100000
Training Epoch: 3 [12544/45000]	Loss: 1.6273	LR: 0.100000
Training Epoch: 3 [12800/45000]	Loss: 1.6996	LR: 0.100000
Training Epoch: 3 [13056/45000]	Loss: 1.5676	LR: 0.100000
Training Epoch: 3 [13312/45000]	Loss: 1.5283	LR: 0.100000
Training Epoch: 3 [13568/45000]	Loss: 1.6075	LR: 0.100000
Training Epoch: 3 [13824/45000]	Loss: 1.6095	LR: 0.100000
Training Epoch: 3 [14080/45000]	Loss: 1.5252	LR: 0.100000
Training Epoch: 3 [14336/45000]	Loss: 1.5804	LR: 0.100000
Training Epoch: 3 [14592/45000]	Loss: 1.5172	LR: 0.100000
Training Epoch: 3 [14848/45000]	Loss: 1.5698	LR: 0.100000
Training Epoch: 3 [15104/45000]	Loss: 1.5233	LR: 0.100000
Training Epoch: 3 [15360/45000]	Loss: 1.5627	LR: 0.100000
Training Epoch: 3 [15616/45000]	Loss: 1.4965	LR: 0.100000
Training Epoch: 3 [15872/45000]	Loss: 1.5277	LR: 0.100000
Training Epoch: 3 [16128/45000]	Loss: 1.5838	LR: 0.100000
Training Epoch: 3 [16384/45000]	Loss: 1.5732	LR: 0.100000
Training Epoch: 3 [16640/45000]	Loss: 1.5254	LR: 0.100000
Training Epoch: 3 [16896/45000]	Loss: 1.4435	LR: 0.100000
Training Epoch: 3 [17152/45000]	Loss: 1.4161	LR: 0.100000
Training Epoch: 3 [17408/45000]	Loss: 1.5113	LR: 0.100000
Training Epoch: 3 [17664/45000]	Loss: 1.3822	LR: 0.100000
Training Epoch: 3 [17920/45000]	Loss: 1.4842	LR: 0.100000
Training Epoch: 3 [18176/45000]	Loss: 1.5543	LR: 0.100000
Training Epoch: 3 [18432/45000]	Loss: 1.4238	LR: 0.100000
Training Epoch: 3 [18688/45000]	Loss: 1.4956	LR: 0.100000
Training Epoch: 3 [18944/45000]	Loss: 1.5252	LR: 0.100000
Training Epoch: 3 [19200/45000]	Loss: 1.3990	LR: 0.100000
Training Epoch: 3 [19456/45000]	Loss: 1.6049	LR: 0.100000
Training Epoch: 3 [19712/45000]	Loss: 1.5070	LR: 0.100000
Training Epoch: 3 [19968/45000]	Loss: 1.3497	LR: 0.100000
Training Epoch: 3 [20224/45000]	Loss: 1.4077	LR: 0.100000
Training Epoch: 3 [20480/45000]	Loss: 1.4350	LR: 0.100000
Training Epoch: 3 [20736/45000]	Loss: 1.4840	LR: 0.100000
Training Epoch: 3 [20992/45000]	Loss: 1.5008	LR: 0.100000
Training Epoch: 3 [21248/45000]	Loss: 1.3955	LR: 0.100000
Training Epoch: 3 [21504/45000]	Loss: 1.4370	LR: 0.100000
Training Epoch: 3 [21760/45000]	Loss: 1.2944	LR: 0.100000
Training Epoch: 3 [22016/45000]	Loss: 1.3427	LR: 0.100000
Training Epoch: 3 [22272/45000]	Loss: 1.2393	LR: 0.100000
Training Epoch: 3 [22528/45000]	Loss: 1.3808	LR: 0.100000
Training Epoch: 3 [22784/45000]	Loss: 1.3990	LR: 0.100000
Training Epoch: 3 [23040/45000]	Loss: 1.3522	LR: 0.100000
Training Epoch: 3 [23296/45000]	Loss: 1.4776	LR: 0.100000
Training Epoch: 3 [23552/45000]	Loss: 1.4140	LR: 0.100000
Training Epoch: 3 [23808/45000]	Loss: 1.2421	LR: 0.100000
Training Epoch: 3 [24064/45000]	Loss: 1.3812	LR: 0.100000
Training Epoch: 3 [24320/45000]	Loss: 1.3678	LR: 0.100000
Training Epoch: 3 [24576/45000]	Loss: 1.2597	LR: 0.100000
Training Epoch: 3 [24832/45000]	Loss: 1.3869	LR: 0.100000
Training Epoch: 3 [25088/45000]	Loss: 1.3866	LR: 0.100000
Training Epoch: 3 [25344/45000]	Loss: 1.5012	LR: 0.100000
Training Epoch: 3 [25600/45000]	Loss: 1.3839	LR: 0.100000
Training Epoch: 3 [25856/45000]	Loss: 1.4046	LR: 0.100000
Training Epoch: 3 [26112/45000]	Loss: 1.3316	LR: 0.100000
Training Epoch: 3 [26368/45000]	Loss: 1.4838	LR: 0.100000
Training Epoch: 3 [26624/45000]	Loss: 1.3962	LR: 0.100000
Training Epoch: 3 [26880/45000]	Loss: 1.3545	LR: 0.100000
Training Epoch: 3 [27136/45000]	Loss: 1.3731	LR: 0.100000
Training Epoch: 3 [27392/45000]	Loss: 1.4285	LR: 0.100000
Training Epoch: 3 [27648/45000]	Loss: 1.3208	LR: 0.100000
Training Epoch: 3 [27904/45000]	Loss: 1.3190	LR: 0.100000
Training Epoch: 3 [28160/45000]	Loss: 1.3549	LR: 0.100000
Training Epoch: 3 [28416/45000]	Loss: 1.2204	LR: 0.100000
Training Epoch: 3 [28672/45000]	Loss: 1.4330	LR: 0.100000
Training Epoch: 3 [28928/45000]	Loss: 1.3678	LR: 0.100000
Training Epoch: 3 [29184/45000]	Loss: 1.4354	LR: 0.100000
Training Epoch: 3 [29440/45000]	Loss: 1.3515	LR: 0.100000
Training Epoch: 3 [29696/45000]	Loss: 1.3240	LR: 0.100000
Training Epoch: 3 [29952/45000]	Loss: 1.2980	LR: 0.100000
Training Epoch: 3 [30208/45000]	Loss: 1.3400	LR: 0.100000
Training Epoch: 3 [30464/45000]	Loss: 1.3808	LR: 0.100000
Training Epoch: 3 [30720/45000]	Loss: 1.5235	LR: 0.100000
Training Epoch: 3 [30976/45000]	Loss: 1.2838	LR: 0.100000
Training Epoch: 3 [31232/45000]	Loss: 1.3437	LR: 0.100000
Training Epoch: 3 [31488/45000]	Loss: 1.3074	LR: 0.100000
Training Epoch: 3 [31744/45000]	Loss: 1.2821	LR: 0.100000
Training Epoch: 3 [32000/45000]	Loss: 1.1898	LR: 0.100000
Training Epoch: 3 [32256/45000]	Loss: 1.3219	LR: 0.100000
Training Epoch: 3 [32512/45000]	Loss: 1.4967	LR: 0.100000
Training Epoch: 3 [32768/45000]	Loss: 1.3351	LR: 0.100000
Training Epoch: 3 [33024/45000]	Loss: 1.4593	LR: 0.100000
Training Epoch: 3 [33280/45000]	Loss: 1.3203	LR: 0.100000
Training Epoch: 3 [33536/45000]	Loss: 1.2433	LR: 0.100000
Training Epoch: 3 [33792/45000]	Loss: 1.3580	LR: 0.100000
Training Epoch: 3 [34048/45000]	Loss: 1.3368	LR: 0.100000
Training Epoch: 3 [34304/45000]	Loss: 1.3302	LR: 0.100000
Training Epoch: 3 [34560/45000]	Loss: 1.2896	LR: 0.100000
Training Epoch: 3 [34816/45000]	Loss: 1.2934	LR: 0.100000
Training Epoch: 3 [35072/45000]	Loss: 1.4246	LR: 0.100000
Training Epoch: 3 [35328/45000]	Loss: 1.2931	LR: 0.100000
Training Epoch: 3 [35584/45000]	Loss: 1.4094	LR: 0.100000
Training Epoch: 3 [35840/45000]	Loss: 1.3030	LR: 0.100000
Training Epoch: 3 [36096/45000]	Loss: 1.3243	LR: 0.100000
Training Epoch: 3 [36352/45000]	Loss: 1.2110	LR: 0.100000
Training Epoch: 3 [36608/45000]	Loss: 1.1914	LR: 0.100000
Training Epoch: 3 [36864/45000]	Loss: 1.2498	LR: 0.100000
Training Epoch: 3 [37120/45000]	Loss: 1.2846	LR: 0.100000
Training Epoch: 3 [37376/45000]	Loss: 1.1925	LR: 0.100000
Training Epoch: 3 [37632/45000]	Loss: 1.1959	LR: 0.100000
Training Epoch: 3 [37888/45000]	Loss: 1.1088	LR: 0.100000
Training Epoch: 3 [38144/45000]	Loss: 1.2025	LR: 0.100000
Training Epoch: 3 [38400/45000]	Loss: 1.3480	LR: 0.100000
Training Epoch: 3 [38656/45000]	Loss: 1.0869	LR: 0.100000
Training Epoch: 3 [38912/45000]	Loss: 1.3169	LR: 0.100000
Training Epoch: 3 [39168/45000]	Loss: 1.1672	LR: 0.100000
Training Epoch: 3 [39424/45000]	Loss: 1.1802	LR: 0.100000
Training Epoch: 3 [39680/45000]	Loss: 1.1398	LR: 0.100000
Training Epoch: 3 [39936/45000]	Loss: 1.1823	LR: 0.100000
Training Epoch: 3 [40192/45000]	Loss: 1.1429	LR: 0.100000
Training Epoch: 3 [40448/45000]	Loss: 1.3859	LR: 0.100000
Training Epoch: 3 [40704/45000]	Loss: 1.0867	LR: 0.100000
Training Epoch: 3 [40960/45000]	Loss: 1.1596	LR: 0.100000
Training Epoch: 3 [41216/45000]	Loss: 1.1620	LR: 0.100000
Training Epoch: 3 [41472/45000]	Loss: 1.2752	LR: 0.100000
Training Epoch: 3 [41728/45000]	Loss: 1.1218	LR: 0.100000
Training Epoch: 3 [41984/45000]	Loss: 1.2419	LR: 0.100000
Training Epoch: 3 [42240/45000]	Loss: 1.1607	LR: 0.100000
Training Epoch: 3 [42496/45000]	Loss: 1.2259	LR: 0.100000
Training Epoch: 3 [42752/45000]	Loss: 1.1596	LR: 0.100000
Training Epoch: 3 [43008/45000]	Loss: 1.1238	LR: 0.100000
Training Epoch: 3 [43264/45000]	Loss: 1.1709	LR: 0.100000
Training Epoch: 3 [43520/45000]	Loss: 1.1605	LR: 0.100000
Training Epoch: 3 [43776/45000]	Loss: 1.0891	LR: 0.100000
Training Epoch: 3 [44032/45000]	Loss: 1.0083	LR: 0.100000
Training Epoch: 3 [44288/45000]	Loss: 1.1498	LR: 0.100000
Training Epoch: 3 [44544/45000]	Loss: 1.0404	LR: 0.100000
Training Epoch: 3 [44800/45000]	Loss: 1.0983	LR: 0.100000
Training Epoch: 3 [45000/45000]	Loss: 1.0200	LR: 0.100000
Epoch 3 - Average Train Loss: 1.4386, Train Accuracy: 0.4747
Epoch 3 training time consumed: 324.57s
Evaluating Network.....
Test set: Epoch: 3, Average loss: 0.0046, Accuracy: 0.5948, Time consumed:23.41s
Training Epoch: 4 [256/45000]	Loss: 1.2102	LR: 0.100000
Training Epoch: 4 [512/45000]	Loss: 1.1153	LR: 0.100000
Training Epoch: 4 [768/45000]	Loss: 1.0035	LR: 0.100000
Training Epoch: 4 [1024/45000]	Loss: 1.1105	LR: 0.100000
Training Epoch: 4 [1280/45000]	Loss: 0.9291	LR: 0.100000
Training Epoch: 4 [1536/45000]	Loss: 0.9307	LR: 0.100000
Training Epoch: 4 [1792/45000]	Loss: 1.1800	LR: 0.100000
Training Epoch: 4 [2048/45000]	Loss: 1.0945	LR: 0.100000
Training Epoch: 4 [2304/45000]	Loss: 1.3002	LR: 0.100000
Training Epoch: 4 [2560/45000]	Loss: 1.4083	LR: 0.100000
Training Epoch: 4 [2816/45000]	Loss: 1.2239	LR: 0.100000
Training Epoch: 4 [3072/45000]	Loss: 1.1076	LR: 0.100000
Training Epoch: 4 [3328/45000]	Loss: 1.0557	LR: 0.100000
Training Epoch: 4 [3584/45000]	Loss: 1.0888	LR: 0.100000
Training Epoch: 4 [3840/45000]	Loss: 1.2775	LR: 0.100000
Training Epoch: 4 [4096/45000]	Loss: 0.9952	LR: 0.100000
Training Epoch: 4 [4352/45000]	Loss: 1.1030	LR: 0.100000
Training Epoch: 4 [4608/45000]	Loss: 0.9966	LR: 0.100000
Training Epoch: 4 [4864/45000]	Loss: 0.9336	LR: 0.100000
Training Epoch: 4 [5120/45000]	Loss: 0.8808	LR: 0.100000
Training Epoch: 4 [5376/45000]	Loss: 0.9641	LR: 0.100000
Training Epoch: 4 [5632/45000]	Loss: 1.0827	LR: 0.100000
Training Epoch: 4 [5888/45000]	Loss: 0.9100	LR: 0.100000
Training Epoch: 4 [6144/45000]	Loss: 0.8995	LR: 0.100000
Training Epoch: 4 [6400/45000]	Loss: 0.8491	LR: 0.100000
Training Epoch: 4 [6656/45000]	Loss: 0.9536	LR: 0.100000
Training Epoch: 4 [6912/45000]	Loss: 0.8714	LR: 0.100000
Training Epoch: 4 [7168/45000]	Loss: 0.8548	LR: 0.100000
Training Epoch: 4 [7424/45000]	Loss: 0.8896	LR: 0.100000
Training Epoch: 4 [7680/45000]	Loss: 0.8332	LR: 0.100000
Training Epoch: 4 [7936/45000]	Loss: 0.9228	LR: 0.100000
Training Epoch: 4 [8192/45000]	Loss: 0.7811	LR: 0.100000
Training Epoch: 4 [8448/45000]	Loss: 0.8514	LR: 0.100000
Training Epoch: 4 [8704/45000]	Loss: 0.8280	LR: 0.100000
Training Epoch: 4 [8960/45000]	Loss: 0.7086	LR: 0.100000
Training Epoch: 4 [9216/45000]	Loss: 0.8216	LR: 0.100000
Training Epoch: 4 [9472/45000]	Loss: 0.7695	LR: 0.100000
Training Epoch: 4 [9728/45000]	Loss: 0.8015	LR: 0.100000
Training Epoch: 4 [9984/45000]	Loss: 0.8384	LR: 0.100000
Training Epoch: 4 [10240/45000]	Loss: 0.7574	LR: 0.100000
Training Epoch: 4 [10496/45000]	Loss: 0.7693	LR: 0.100000
Training Epoch: 4 [10752/45000]	Loss: 0.7338	LR: 0.100000
Training Epoch: 4 [11008/45000]	Loss: 0.7935	LR: 0.100000
Training Epoch: 4 [11264/45000]	Loss: 0.6830	LR: 0.100000
Training Epoch: 4 [11520/45000]	Loss: 0.6994	LR: 0.100000
Training Epoch: 4 [11776/45000]	Loss: 0.7956	LR: 0.100000
Training Epoch: 4 [12032/45000]	Loss: 0.7119	LR: 0.100000
Training Epoch: 4 [12288/45000]	Loss: 0.6683	LR: 0.100000
Training Epoch: 4 [12544/45000]	Loss: 0.6960	LR: 0.100000
Training Epoch: 4 [12800/45000]	Loss: 0.6076	LR: 0.100000
Training Epoch: 4 [13056/45000]	Loss: 0.6310	LR: 0.100000
Training Epoch: 4 [13312/45000]	Loss: 0.5787	LR: 0.100000
Training Epoch: 4 [13568/45000]	Loss: 0.7890	LR: 0.100000
Training Epoch: 4 [13824/45000]	Loss: 0.6858	LR: 0.100000
Training Epoch: 4 [14080/45000]	Loss: 0.5932	LR: 0.100000
Training Epoch: 4 [14336/45000]	Loss: 0.8651	LR: 0.100000
Training Epoch: 4 [14592/45000]	Loss: 0.8821	LR: 0.100000
Training Epoch: 4 [14848/45000]	Loss: 0.8793	LR: 0.100000
Training Epoch: 4 [15104/45000]	Loss: 0.8502	LR: 0.100000
Training Epoch: 4 [15360/45000]	Loss: 0.6311	LR: 0.100000
Training Epoch: 4 [15616/45000]	Loss: 0.6463	LR: 0.100000
Training Epoch: 4 [15872/45000]	Loss: 0.7407	LR: 0.100000
Training Epoch: 4 [16128/45000]	Loss: 0.7868	LR: 0.100000
Training Epoch: 4 [16384/45000]	Loss: 0.6726	LR: 0.100000
Training Epoch: 4 [16640/45000]	Loss: 0.7117	LR: 0.100000
Training Epoch: 4 [16896/45000]	Loss: 0.5773	LR: 0.100000
Training Epoch: 4 [17152/45000]	Loss: 0.5902	LR: 0.100000
Training Epoch: 4 [17408/45000]	Loss: 0.6191	LR: 0.100000
Training Epoch: 4 [17664/45000]	Loss: 0.7266	LR: 0.100000
Training Epoch: 4 [17920/45000]	Loss: 0.6768	LR: 0.100000
Training Epoch: 4 [18176/45000]	Loss: 0.5027	LR: 0.100000
Training Epoch: 4 [18432/45000]	Loss: 0.5707	LR: 0.100000
Training Epoch: 4 [18688/45000]	Loss: 0.5607	LR: 0.100000
Training Epoch: 4 [18944/45000]	Loss: 0.5642	LR: 0.100000
Training Epoch: 4 [19200/45000]	Loss: 0.5202	LR: 0.100000
Training Epoch: 4 [19456/45000]	Loss: 0.6686	LR: 0.100000
Training Epoch: 4 [19712/45000]	Loss: 0.8138	LR: 0.100000
Training Epoch: 4 [19968/45000]	Loss: 0.6324	LR: 0.100000
Training Epoch: 4 [20224/45000]	Loss: 0.8082	LR: 0.100000
Training Epoch: 4 [20480/45000]	Loss: 0.6143	LR: 0.100000
Training Epoch: 4 [20736/45000]	Loss: 0.7807	LR: 0.100000
Training Epoch: 4 [20992/45000]	Loss: 0.6560	LR: 0.100000
Training Epoch: 4 [21248/45000]	Loss: 0.8264	LR: 0.100000
Training Epoch: 4 [21504/45000]	Loss: 0.8686	LR: 0.100000
Training Epoch: 4 [21760/45000]	Loss: 0.7240	LR: 0.100000
Training Epoch: 4 [22016/45000]	Loss: 0.6659	LR: 0.100000
Training Epoch: 4 [22272/45000]	Loss: 0.6197	LR: 0.100000
Training Epoch: 4 [22528/45000]	Loss: 0.5609	LR: 0.100000
Training Epoch: 4 [22784/45000]	Loss: 0.6141	LR: 0.100000
Training Epoch: 4 [23040/45000]	Loss: 0.5523	LR: 0.100000
Training Epoch: 4 [23296/45000]	Loss: 0.5774	LR: 0.100000
Training Epoch: 4 [23552/45000]	Loss: 0.6399	LR: 0.100000
Training Epoch: 4 [23808/45000]	Loss: 0.4866	LR: 0.100000
Training Epoch: 4 [24064/45000]	Loss: 0.5815	LR: 0.100000
Training Epoch: 4 [24320/45000]	Loss: 0.5380	LR: 0.100000
Training Epoch: 4 [24576/45000]	Loss: 0.5175	LR: 0.100000
Training Epoch: 4 [24832/45000]	Loss: 0.3839	LR: 0.100000
Training Epoch: 4 [25088/45000]	Loss: 0.4194	LR: 0.100000
Training Epoch: 4 [25344/45000]	Loss: 0.4455	LR: 0.100000
Training Epoch: 4 [25600/45000]	Loss: 0.4582	LR: 0.100000
Training Epoch: 4 [25856/45000]	Loss: 0.5718	LR: 0.100000
Training Epoch: 4 [26112/45000]	Loss: 0.4179	LR: 0.100000
Training Epoch: 4 [26368/45000]	Loss: 0.4245	LR: 0.100000
Training Epoch: 4 [26624/45000]	Loss: 0.4137	LR: 0.100000
Training Epoch: 4 [26880/45000]	Loss: 0.4194	LR: 0.100000
Training Epoch: 4 [27136/45000]	Loss: 0.4083	LR: 0.100000
Training Epoch: 4 [27392/45000]	Loss: 0.3709	LR: 0.100000
Training Epoch: 4 [27648/45000]	Loss: 0.4185	LR: 0.100000
Training Epoch: 4 [27904/45000]	Loss: 0.3550	LR: 0.100000
Training Epoch: 4 [28160/45000]	Loss: 0.3864	LR: 0.100000
Training Epoch: 4 [28416/45000]	Loss: 0.3601	LR: 0.100000
Training Epoch: 4 [28672/45000]	Loss: 0.3984	LR: 0.100000
Training Epoch: 4 [28928/45000]	Loss: 0.5552	LR: 0.100000
Training Epoch: 4 [29184/45000]	Loss: 0.3931	LR: 0.100000
Training Epoch: 4 [29440/45000]	Loss: 0.5297	LR: 0.100000
Training Epoch: 4 [29696/45000]	Loss: 0.5739	LR: 0.100000
Training Epoch: 4 [29952/45000]	Loss: 0.4752	LR: 0.100000
Training Epoch: 4 [30208/45000]	Loss: 0.4339	LR: 0.100000
Training Epoch: 4 [30464/45000]	Loss: 0.4112	LR: 0.100000
Training Epoch: 4 [30720/45000]	Loss: 0.3592	LR: 0.100000
Training Epoch: 4 [30976/45000]	Loss: 0.3324	LR: 0.100000
Training Epoch: 4 [31232/45000]	Loss: 0.4040	LR: 0.100000
Training Epoch: 4 [31488/45000]	Loss: 0.4060	LR: 0.100000
Training Epoch: 4 [31744/45000]	Loss: 0.4450	LR: 0.100000
Training Epoch: 4 [32000/45000]	Loss: 0.5071	LR: 0.100000
Training Epoch: 4 [32256/45000]	Loss: 0.3355	LR: 0.100000
Training Epoch: 4 [32512/45000]	Loss: 0.5553	LR: 0.100000
Training Epoch: 4 [32768/45000]	Loss: 0.3930	LR: 0.100000
Training Epoch: 4 [33024/45000]	Loss: 0.4437	LR: 0.100000
Training Epoch: 4 [33280/45000]	Loss: 0.4967	LR: 0.100000
Training Epoch: 4 [33536/45000]	Loss: 0.4120	LR: 0.100000
Training Epoch: 4 [33792/45000]	Loss: 0.4123	LR: 0.100000
Training Epoch: 4 [34048/45000]	Loss: 0.4217	LR: 0.100000
Training Epoch: 4 [34304/45000]	Loss: 0.3558	LR: 0.100000
Training Epoch: 4 [34560/45000]	Loss: 0.3208	LR: 0.100000
Training Epoch: 4 [34816/45000]	Loss: 0.2840	LR: 0.100000
Training Epoch: 4 [35072/45000]	Loss: 0.4088	LR: 0.100000
Training Epoch: 4 [35328/45000]	Loss: 0.3779	LR: 0.100000
Training Epoch: 4 [35584/45000]	Loss: 0.4642	LR: 0.100000
Training Epoch: 4 [35840/45000]	Loss: 0.3074	LR: 0.100000
Training Epoch: 4 [36096/45000]	Loss: 0.3691	LR: 0.100000
Training Epoch: 4 [36352/45000]	Loss: 0.3459	LR: 0.100000
Training Epoch: 4 [36608/45000]	Loss: 0.3740	LR: 0.100000
Training Epoch: 4 [36864/45000]	Loss: 0.3370	LR: 0.100000
Training Epoch: 4 [37120/45000]	Loss: 0.3430	LR: 0.100000
Training Epoch: 4 [37376/45000]	Loss: 0.3091	LR: 0.100000
Training Epoch: 4 [37632/45000]	Loss: 0.4234	LR: 0.100000
Training Epoch: 4 [37888/45000]	Loss: 0.3330	LR: 0.100000
Training Epoch: 4 [38144/45000]	Loss: 0.3479	LR: 0.100000
Training Epoch: 4 [38400/45000]	Loss: 0.5351	LR: 0.100000
Training Epoch: 4 [38656/45000]	Loss: 0.3200	LR: 0.100000
Training Epoch: 4 [38912/45000]	Loss: 0.3357	LR: 0.100000
Training Epoch: 4 [39168/45000]	Loss: 0.3357	LR: 0.100000
Training Epoch: 4 [39424/45000]	Loss: 0.3744	LR: 0.100000
Training Epoch: 4 [39680/45000]	Loss: 0.3395	LR: 0.100000
Training Epoch: 4 [39936/45000]	Loss: 0.2701	LR: 0.100000
Training Epoch: 4 [40192/45000]	Loss: 0.3625	LR: 0.100000
Training Epoch: 4 [40448/45000]	Loss: 0.3039	LR: 0.100000
Training Epoch: 4 [40704/45000]	Loss: 0.2539	LR: 0.100000
Training Epoch: 4 [40960/45000]	Loss: 0.2697	LR: 0.100000
Training Epoch: 4 [41216/45000]	Loss: 0.3348	LR: 0.100000
Training Epoch: 4 [41472/45000]	Loss: 0.2819	LR: 0.100000
Training Epoch: 4 [41728/45000]	Loss: 0.3436	LR: 0.100000
Training Epoch: 4 [41984/45000]	Loss: 0.3386	LR: 0.100000
Training Epoch: 4 [42240/45000]	Loss: 0.4165	LR: 0.100000
Training Epoch: 4 [42496/45000]	Loss: 0.2398	LR: 0.100000
Training Epoch: 4 [42752/45000]	Loss: 0.3361	LR: 0.100000
Training Epoch: 4 [43008/45000]	Loss: 0.3593	LR: 0.100000
Training Epoch: 4 [43264/45000]	Loss: 0.2918	LR: 0.100000
Training Epoch: 4 [43520/45000]	Loss: 0.3431	LR: 0.100000
Training Epoch: 4 [43776/45000]	Loss: 0.2962	LR: 0.100000
Training Epoch: 4 [44032/45000]	Loss: 0.2697	LR: 0.100000
Training Epoch: 4 [44288/45000]	Loss: 0.3295	LR: 0.100000
Training Epoch: 4 [44544/45000]	Loss: 0.3258	LR: 0.100000
Training Epoch: 4 [44800/45000]	Loss: 0.3206	LR: 0.100000
Training Epoch: 4 [45000/45000]	Loss: 0.2771	LR: 0.100000
Epoch 4 - Average Train Loss: 0.6103, Train Accuracy: 0.7863
Epoch 4 training time consumed: 324.78s
Evaluating Network.....
Test set: Epoch: 4, Average loss: 0.0010, Accuracy: 0.9135, Time consumed:23.42s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-4-best.pth
Training Epoch: 5 [256/45000]	Loss: 0.2632	LR: 0.100000
Training Epoch: 5 [512/45000]	Loss: 0.2921	LR: 0.100000
Training Epoch: 5 [768/45000]	Loss: 0.3575	LR: 0.100000
Training Epoch: 5 [1024/45000]	Loss: 0.2421	LR: 0.100000
Training Epoch: 5 [1280/45000]	Loss: 0.2315	LR: 0.100000
Training Epoch: 5 [1536/45000]	Loss: 0.3318	LR: 0.100000
Training Epoch: 5 [1792/45000]	Loss: 0.2211	LR: 0.100000
Training Epoch: 5 [2048/45000]	Loss: 0.2671	LR: 0.100000
Training Epoch: 5 [2304/45000]	Loss: 0.3465	LR: 0.100000
Training Epoch: 5 [2560/45000]	Loss: 0.2204	LR: 0.100000
Training Epoch: 5 [2816/45000]	Loss: 0.3559	LR: 0.100000
Training Epoch: 5 [3072/45000]	Loss: 0.3328	LR: 0.100000
Training Epoch: 5 [3328/45000]	Loss: 0.3094	LR: 0.100000
Training Epoch: 5 [3584/45000]	Loss: 0.3760	LR: 0.100000
Training Epoch: 5 [3840/45000]	Loss: 0.2372	LR: 0.100000
Training Epoch: 5 [4096/45000]	Loss: 0.3497	LR: 0.100000
Training Epoch: 5 [4352/45000]	Loss: 0.2601	LR: 0.100000
Training Epoch: 5 [4608/45000]	Loss: 0.2591	LR: 0.100000
Training Epoch: 5 [4864/45000]	Loss: 0.2803	LR: 0.100000
Training Epoch: 5 [5120/45000]	Loss: 0.3405	LR: 0.100000
Training Epoch: 5 [5376/45000]	Loss: 0.2409	LR: 0.100000
Training Epoch: 5 [5632/45000]	Loss: 0.3374	LR: 0.100000
Training Epoch: 5 [5888/45000]	Loss: 0.3546	LR: 0.100000
Training Epoch: 5 [6144/45000]	Loss: 0.3506	LR: 0.100000
Training Epoch: 5 [6400/45000]	Loss: 0.2284	LR: 0.100000
Training Epoch: 5 [6656/45000]	Loss: 0.3672	LR: 0.100000
Training Epoch: 5 [6912/45000]	Loss: 0.2662	LR: 0.100000
Training Epoch: 5 [7168/45000]	Loss: 0.3187	LR: 0.100000
Training Epoch: 5 [7424/45000]	Loss: 0.3725	LR: 0.100000
Training Epoch: 5 [7680/45000]	Loss: 0.2400	LR: 0.100000
Training Epoch: 5 [7936/45000]	Loss: 0.3668	LR: 0.100000
Training Epoch: 5 [8192/45000]	Loss: 0.3026	LR: 0.100000
Training Epoch: 5 [8448/45000]	Loss: 0.3184	LR: 0.100000
Training Epoch: 5 [8704/45000]	Loss: 0.2580	LR: 0.100000
Training Epoch: 5 [8960/45000]	Loss: 0.1901	LR: 0.100000
Training Epoch: 5 [9216/45000]	Loss: 0.3058	LR: 0.100000
Training Epoch: 5 [9472/45000]	Loss: 0.2609	LR: 0.100000
Training Epoch: 5 [9728/45000]	Loss: 0.2837	LR: 0.100000
Training Epoch: 5 [9984/45000]	Loss: 0.2181	LR: 0.100000
Training Epoch: 5 [10240/45000]	Loss: 0.3136	LR: 0.100000
Training Epoch: 5 [10496/45000]	Loss: 0.2678	LR: 0.100000
Training Epoch: 5 [10752/45000]	Loss: 0.2794	LR: 0.100000
Training Epoch: 5 [11008/45000]	Loss: 0.3275	LR: 0.100000
Training Epoch: 5 [11264/45000]	Loss: 0.2816	LR: 0.100000
Training Epoch: 5 [11520/45000]	Loss: 0.3345	LR: 0.100000
Training Epoch: 5 [11776/45000]	Loss: 0.3215	LR: 0.100000
Training Epoch: 5 [12032/45000]	Loss: 0.2334	LR: 0.100000
Training Epoch: 5 [12288/45000]	Loss: 0.3123	LR: 0.100000
Training Epoch: 5 [12544/45000]	Loss: 0.2843	LR: 0.100000
Training Epoch: 5 [12800/45000]	Loss: 0.2297	LR: 0.100000
Training Epoch: 5 [13056/45000]	Loss: 0.2012	LR: 0.100000
Training Epoch: 5 [13312/45000]	Loss: 0.2095	LR: 0.100000
Training Epoch: 5 [13568/45000]	Loss: 0.3484	LR: 0.100000
Training Epoch: 5 [13824/45000]	Loss: 0.2275	LR: 0.100000
Training Epoch: 5 [14080/45000]	Loss: 0.1578	LR: 0.100000
Training Epoch: 5 [14336/45000]	Loss: 0.3399	LR: 0.100000
Training Epoch: 5 [14592/45000]	Loss: 0.3046	LR: 0.100000
Training Epoch: 5 [14848/45000]	Loss: 0.3278	LR: 0.100000
Training Epoch: 5 [15104/45000]	Loss: 0.3837	LR: 0.100000
Training Epoch: 5 [15360/45000]	Loss: 0.3258	LR: 0.100000
Training Epoch: 5 [15616/45000]	Loss: 0.3183	LR: 0.100000
Training Epoch: 5 [15872/45000]	Loss: 0.2800	LR: 0.100000
Training Epoch: 5 [16128/45000]	Loss: 0.2920	LR: 0.100000
Training Epoch: 5 [16384/45000]	Loss: 0.2302	LR: 0.100000
Training Epoch: 5 [16640/45000]	Loss: 0.3514	LR: 0.100000
Training Epoch: 5 [16896/45000]	Loss: 0.3115	LR: 0.100000
Training Epoch: 5 [17152/45000]	Loss: 0.3066	LR: 0.100000
Training Epoch: 5 [17408/45000]	Loss: 0.2575	LR: 0.100000
Training Epoch: 5 [17664/45000]	Loss: 0.2168	LR: 0.100000
Training Epoch: 5 [17920/45000]	Loss: 0.1932	LR: 0.100000
Training Epoch: 5 [18176/45000]	Loss: 0.2655	LR: 0.100000
Training Epoch: 5 [18432/45000]	Loss: 0.2134	LR: 0.100000
Training Epoch: 5 [18688/45000]	Loss: 0.3564	LR: 0.100000
Training Epoch: 5 [18944/45000]	Loss: 0.2390	LR: 0.100000
Training Epoch: 5 [19200/45000]	Loss: 0.3025	LR: 0.100000
Training Epoch: 5 [19456/45000]	Loss: 0.5521	LR: 0.100000
Training Epoch: 5 [19712/45000]	Loss: 0.2008	LR: 0.100000
Training Epoch: 5 [19968/45000]	Loss: 0.1846	LR: 0.100000
Training Epoch: 5 [20224/45000]	Loss: 0.4666	LR: 0.100000
Training Epoch: 5 [20480/45000]	Loss: 0.2721	LR: 0.100000
Training Epoch: 5 [20736/45000]	Loss: 0.2618	LR: 0.100000
Training Epoch: 5 [20992/45000]	Loss: 0.3580	LR: 0.100000
Training Epoch: 5 [21248/45000]	Loss: 0.2287	LR: 0.100000
Training Epoch: 5 [21504/45000]	Loss: 0.3484	LR: 0.100000
Training Epoch: 5 [21760/45000]	Loss: 0.3341	LR: 0.100000
Training Epoch: 5 [22016/45000]	Loss: 0.2459	LR: 0.100000
Training Epoch: 5 [22272/45000]	Loss: 0.3097	LR: 0.100000
Training Epoch: 5 [22528/45000]	Loss: 0.2645	LR: 0.100000
Training Epoch: 5 [22784/45000]	Loss: 0.2628	LR: 0.100000
Training Epoch: 5 [23040/45000]	Loss: 0.3958	LR: 0.100000
Training Epoch: 5 [23296/45000]	Loss: 0.2852	LR: 0.100000
Training Epoch: 5 [23552/45000]	Loss: 0.1948	LR: 0.100000
Training Epoch: 5 [23808/45000]	Loss: 0.1685	LR: 0.100000
Training Epoch: 5 [24064/45000]	Loss: 0.3249	LR: 0.100000
Training Epoch: 5 [24320/45000]	Loss: 0.2519	LR: 0.100000
Training Epoch: 5 [24576/45000]	Loss: 0.2738	LR: 0.100000
Training Epoch: 5 [24832/45000]	Loss: 0.2741	LR: 0.100000
Training Epoch: 5 [25088/45000]	Loss: 0.2598	LR: 0.100000
Training Epoch: 5 [25344/45000]	Loss: 0.2140	LR: 0.100000
Training Epoch: 5 [25600/45000]	Loss: 0.1651	LR: 0.100000
Training Epoch: 5 [25856/45000]	Loss: 0.2532	LR: 0.100000
Training Epoch: 5 [26112/45000]	Loss: 0.2850	LR: 0.100000
Training Epoch: 5 [26368/45000]	Loss: 0.3085	LR: 0.100000
Training Epoch: 5 [26624/45000]	Loss: 0.2393	LR: 0.100000
Training Epoch: 5 [26880/45000]	Loss: 0.3000	LR: 0.100000
Training Epoch: 5 [27136/45000]	Loss: 0.2957	LR: 0.100000
Training Epoch: 5 [27392/45000]	Loss: 0.3119	LR: 0.100000
Training Epoch: 5 [27648/45000]	Loss: 0.3860	LR: 0.100000
Training Epoch: 5 [27904/45000]	Loss: 0.3427	LR: 0.100000
Training Epoch: 5 [28160/45000]	Loss: 0.3043	LR: 0.100000
Training Epoch: 5 [28416/45000]	Loss: 0.2637	LR: 0.100000
Training Epoch: 5 [28672/45000]	Loss: 0.2455	LR: 0.100000
Training Epoch: 5 [28928/45000]	Loss: 0.2197	LR: 0.100000
Training Epoch: 5 [29184/45000]	Loss: 0.2447	LR: 0.100000
Training Epoch: 5 [29440/45000]	Loss: 0.2609	LR: 0.100000
Training Epoch: 5 [29696/45000]	Loss: 0.1710	LR: 0.100000
Training Epoch: 5 [29952/45000]	Loss: 0.2333	LR: 0.100000
Training Epoch: 5 [30208/45000]	Loss: 0.2465	LR: 0.100000
Training Epoch: 5 [30464/45000]	Loss: 0.1351	LR: 0.100000
Training Epoch: 5 [30720/45000]	Loss: 0.2453	LR: 0.100000
Training Epoch: 5 [30976/45000]	Loss: 0.2977	LR: 0.100000
Training Epoch: 5 [31232/45000]	Loss: 0.1902	LR: 0.100000
Training Epoch: 5 [31488/45000]	Loss: 0.2868	LR: 0.100000
Training Epoch: 5 [31744/45000]	Loss: 0.2619	LR: 0.100000
Training Epoch: 5 [32000/45000]	Loss: 0.3164	LR: 0.100000
Training Epoch: 5 [32256/45000]	Loss: 0.2998	LR: 0.100000
Training Epoch: 5 [32512/45000]	Loss: 0.2829	LR: 0.100000
Training Epoch: 5 [32768/45000]	Loss: 0.2436	LR: 0.100000
Training Epoch: 5 [33024/45000]	Loss: 0.1885	LR: 0.100000
Training Epoch: 5 [33280/45000]	Loss: 0.2791	LR: 0.100000
Training Epoch: 5 [33536/45000]	Loss: 0.2114	LR: 0.100000
Training Epoch: 5 [33792/45000]	Loss: 0.2605	LR: 0.100000
Training Epoch: 5 [34048/45000]	Loss: 0.2259	LR: 0.100000
Training Epoch: 5 [34304/45000]	Loss: 0.3452	LR: 0.100000
Training Epoch: 5 [34560/45000]	Loss: 0.2493	LR: 0.100000
Training Epoch: 5 [34816/45000]	Loss: 0.2700	LR: 0.100000
Training Epoch: 5 [35072/45000]	Loss: 0.2305	LR: 0.100000
Training Epoch: 5 [35328/45000]	Loss: 0.3741	LR: 0.100000
Training Epoch: 5 [35584/45000]	Loss: 0.3382	LR: 0.100000
Training Epoch: 5 [35840/45000]	Loss: 0.3050	LR: 0.100000
Training Epoch: 5 [36096/45000]	Loss: 0.2537	LR: 0.100000
Training Epoch: 5 [36352/45000]	Loss: 0.2270	LR: 0.100000
Training Epoch: 5 [36608/45000]	Loss: 0.3538	LR: 0.100000
Training Epoch: 5 [36864/45000]	Loss: 0.2663	LR: 0.100000
Training Epoch: 5 [37120/45000]	Loss: 0.2684	LR: 0.100000
Training Epoch: 5 [37376/45000]	Loss: 0.2582	LR: 0.100000
Training Epoch: 5 [37632/45000]	Loss: 0.2730	LR: 0.100000
Training Epoch: 5 [37888/45000]	Loss: 0.3495	LR: 0.100000
Training Epoch: 5 [38144/45000]	Loss: 0.2040	LR: 0.100000
Training Epoch: 5 [38400/45000]	Loss: 0.2483	LR: 0.100000
Training Epoch: 5 [38656/45000]	Loss: 0.3662	LR: 0.100000
Training Epoch: 5 [38912/45000]	Loss: 0.2009	LR: 0.100000
Training Epoch: 5 [39168/45000]	Loss: 0.3097	LR: 0.100000
Training Epoch: 5 [39424/45000]	Loss: 0.3251	LR: 0.100000
Training Epoch: 5 [39680/45000]	Loss: 0.3088	LR: 0.100000
Training Epoch: 5 [39936/45000]	Loss: 0.2553	LR: 0.100000
Training Epoch: 5 [40192/45000]	Loss: 0.2428	LR: 0.100000
Training Epoch: 5 [40448/45000]	Loss: 0.2931	LR: 0.100000
Training Epoch: 5 [40704/45000]	Loss: 0.3501	LR: 0.100000
Training Epoch: 5 [40960/45000]	Loss: 0.2456	LR: 0.100000
Training Epoch: 5 [41216/45000]	Loss: 0.2343	LR: 0.100000
Training Epoch: 5 [41472/45000]	Loss: 0.2332	LR: 0.100000
Training Epoch: 5 [41728/45000]	Loss: 0.2642	LR: 0.100000
Training Epoch: 5 [41984/45000]	Loss: 0.2896	LR: 0.100000
Training Epoch: 5 [42240/45000]	Loss: 0.3243	LR: 0.100000
Training Epoch: 5 [42496/45000]	Loss: 0.1921	LR: 0.100000
Training Epoch: 5 [42752/45000]	Loss: 0.2317	LR: 0.100000
Training Epoch: 5 [43008/45000]	Loss: 0.1997	LR: 0.100000
Training Epoch: 5 [43264/45000]	Loss: 0.3191	LR: 0.100000
Training Epoch: 5 [43520/45000]	Loss: 0.2330	LR: 0.100000
Training Epoch: 5 [43776/45000]	Loss: 0.1648	LR: 0.100000
Training Epoch: 5 [44032/45000]	Loss: 0.2064	LR: 0.100000
Training Epoch: 5 [44288/45000]	Loss: 0.2011	LR: 0.100000
Training Epoch: 5 [44544/45000]	Loss: 0.2795	LR: 0.100000
Training Epoch: 5 [44800/45000]	Loss: 0.2687	LR: 0.100000
Training Epoch: 5 [45000/45000]	Loss: 0.2560	LR: 0.100000
Epoch 5 - Average Train Loss: 0.2781, Train Accuracy: 0.9041
Epoch 5 training time consumed: 324.34s
Evaluating Network.....
Test set: Epoch: 5, Average loss: 0.0008, Accuracy: 0.9304, Time consumed:23.39s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-5-best.pth
Training Epoch: 6 [256/45000]	Loss: 0.1913	LR: 0.100000
Training Epoch: 6 [512/45000]	Loss: 0.2550	LR: 0.100000
Training Epoch: 6 [768/45000]	Loss: 0.1720	LR: 0.100000
Training Epoch: 6 [1024/45000]	Loss: 0.2269	LR: 0.100000
Training Epoch: 6 [1280/45000]	Loss: 0.1701	LR: 0.100000
Training Epoch: 6 [1536/45000]	Loss: 0.1731	LR: 0.100000
Training Epoch: 6 [1792/45000]	Loss: 0.2423	LR: 0.100000
Training Epoch: 6 [2048/45000]	Loss: 0.2055	LR: 0.100000
Training Epoch: 6 [2304/45000]	Loss: 0.1441	LR: 0.100000
Training Epoch: 6 [2560/45000]	Loss: 0.1946	LR: 0.100000
Training Epoch: 6 [2816/45000]	Loss: 0.1519	LR: 0.100000
Training Epoch: 6 [3072/45000]	Loss: 0.2228	LR: 0.100000
Training Epoch: 6 [3328/45000]	Loss: 0.2049	LR: 0.100000
Training Epoch: 6 [3584/45000]	Loss: 0.2869	LR: 0.100000
Training Epoch: 6 [3840/45000]	Loss: 0.2801	LR: 0.100000
Training Epoch: 6 [4096/45000]	Loss: 0.1919	LR: 0.100000
Training Epoch: 6 [4352/45000]	Loss: 0.3531	LR: 0.100000
Training Epoch: 6 [4608/45000]	Loss: 0.2474	LR: 0.100000
Training Epoch: 6 [4864/45000]	Loss: 0.2836	LR: 0.100000
Training Epoch: 6 [5120/45000]	Loss: 0.3388	LR: 0.100000
Training Epoch: 6 [5376/45000]	Loss: 0.2921	LR: 0.100000
Training Epoch: 6 [5632/45000]	Loss: 0.2397	LR: 0.100000
Training Epoch: 6 [5888/45000]	Loss: 0.1533	LR: 0.100000
Training Epoch: 6 [6144/45000]	Loss: 0.3520	LR: 0.100000
Training Epoch: 6 [6400/45000]	Loss: 0.1961	LR: 0.100000
Training Epoch: 6 [6656/45000]	Loss: 0.2921	LR: 0.100000
Training Epoch: 6 [6912/45000]	Loss: 0.1965	LR: 0.100000
Training Epoch: 6 [7168/45000]	Loss: 0.1845	LR: 0.100000
Training Epoch: 6 [7424/45000]	Loss: 0.1996	LR: 0.100000
Training Epoch: 6 [7680/45000]	Loss: 0.2050	LR: 0.100000
Training Epoch: 6 [7936/45000]	Loss: 0.2152	LR: 0.100000
Training Epoch: 6 [8192/45000]	Loss: 0.2572	LR: 0.100000
Training Epoch: 6 [8448/45000]	Loss: 0.2734	LR: 0.100000
Training Epoch: 6 [8704/45000]	Loss: 0.1061	LR: 0.100000
Training Epoch: 6 [8960/45000]	Loss: 0.2444	LR: 0.100000
Training Epoch: 6 [9216/45000]	Loss: 0.1672	LR: 0.100000
Training Epoch: 6 [9472/45000]	Loss: 0.1898	LR: 0.100000
Training Epoch: 6 [9728/45000]	Loss: 0.1150	LR: 0.100000
Training Epoch: 6 [9984/45000]	Loss: 0.2366	LR: 0.100000
Training Epoch: 6 [10240/45000]	Loss: 0.1987	LR: 0.100000
Training Epoch: 6 [10496/45000]	Loss: 0.1139	LR: 0.100000
Training Epoch: 6 [10752/45000]	Loss: 0.2058	LR: 0.100000
Training Epoch: 6 [11008/45000]	Loss: 0.3030	LR: 0.100000
Training Epoch: 6 [11264/45000]	Loss: 0.2183	LR: 0.100000
Training Epoch: 6 [11520/45000]	Loss: 0.3267	LR: 0.100000
Training Epoch: 6 [11776/45000]	Loss: 0.1655	LR: 0.100000
Training Epoch: 6 [12032/45000]	Loss: 0.1545	LR: 0.100000
Training Epoch: 6 [12288/45000]	Loss: 0.3328	LR: 0.100000
Training Epoch: 6 [12544/45000]	Loss: 0.2490	LR: 0.100000
Training Epoch: 6 [12800/45000]	Loss: 0.2427	LR: 0.100000
Training Epoch: 6 [13056/45000]	Loss: 0.4003	LR: 0.100000
Training Epoch: 6 [13312/45000]	Loss: 0.2606	LR: 0.100000
Training Epoch: 6 [13568/45000]	Loss: 0.2131	LR: 0.100000
Training Epoch: 6 [13824/45000]	Loss: 0.3402	LR: 0.100000
Training Epoch: 6 [14080/45000]	Loss: 0.2457	LR: 0.100000
Training Epoch: 6 [14336/45000]	Loss: 0.2551	LR: 0.100000
Training Epoch: 6 [14592/45000]	Loss: 0.2717	LR: 0.100000
Training Epoch: 6 [14848/45000]	Loss: 0.2255	LR: 0.100000
Training Epoch: 6 [15104/45000]	Loss: 0.2508	LR: 0.100000
Training Epoch: 6 [15360/45000]	Loss: 0.2834	LR: 0.100000
Training Epoch: 6 [15616/45000]	Loss: 0.1827	LR: 0.100000
Training Epoch: 6 [15872/45000]	Loss: 0.2605	LR: 0.100000
Training Epoch: 6 [16128/45000]	Loss: 0.1716	LR: 0.100000
Training Epoch: 6 [16384/45000]	Loss: 0.1711	LR: 0.100000
Training Epoch: 6 [16640/45000]	Loss: 0.2259	LR: 0.100000
Training Epoch: 6 [16896/45000]	Loss: 0.1806	LR: 0.100000
Training Epoch: 6 [17152/45000]	Loss: 0.2585	LR: 0.100000
Training Epoch: 6 [17408/45000]	Loss: 0.1988	LR: 0.100000
Training Epoch: 6 [17664/45000]	Loss: 0.3160	LR: 0.100000
Training Epoch: 6 [17920/45000]	Loss: 0.2341	LR: 0.100000
Training Epoch: 6 [18176/45000]	Loss: 0.1804	LR: 0.100000
Training Epoch: 6 [18432/45000]	Loss: 0.1687	LR: 0.100000
Training Epoch: 6 [18688/45000]	Loss: 0.2794	LR: 0.100000
Training Epoch: 6 [18944/45000]	Loss: 0.2116	LR: 0.100000
Training Epoch: 6 [19200/45000]	Loss: 0.1983	LR: 0.100000
Training Epoch: 6 [19456/45000]	Loss: 0.1996	LR: 0.100000
Training Epoch: 6 [19712/45000]	Loss: 0.1648	LR: 0.100000
Training Epoch: 6 [19968/45000]	Loss: 0.2036	LR: 0.100000
Training Epoch: 6 [20224/45000]	Loss: 0.1821	LR: 0.100000
Training Epoch: 6 [20480/45000]	Loss: 0.2593	LR: 0.100000
Training Epoch: 6 [20736/45000]	Loss: 0.2198	LR: 0.100000
Training Epoch: 6 [20992/45000]	Loss: 0.2104	LR: 0.100000
Training Epoch: 6 [21248/45000]	Loss: 0.2652	LR: 0.100000
Training Epoch: 6 [21504/45000]	Loss: 0.2929	LR: 0.100000
Training Epoch: 6 [21760/45000]	Loss: 0.2881	LR: 0.100000
Training Epoch: 6 [22016/45000]	Loss: 0.2842	LR: 0.100000
Training Epoch: 6 [22272/45000]	Loss: 0.2691	LR: 0.100000
Training Epoch: 6 [22528/45000]	Loss: 0.2267	LR: 0.100000
Training Epoch: 6 [22784/45000]	Loss: 0.2344	LR: 0.100000
Training Epoch: 6 [23040/45000]	Loss: 0.3208	LR: 0.100000
Training Epoch: 6 [23296/45000]	Loss: 0.2420	LR: 0.100000
Training Epoch: 6 [23552/45000]	Loss: 0.2477	LR: 0.100000
Training Epoch: 6 [23808/45000]	Loss: 0.2273	LR: 0.100000
Training Epoch: 6 [24064/45000]	Loss: 0.2274	LR: 0.100000
Training Epoch: 6 [24320/45000]	Loss: 0.2308	LR: 0.100000
Training Epoch: 6 [24576/45000]	Loss: 0.2927	LR: 0.100000
Training Epoch: 6 [24832/45000]	Loss: 0.2088	LR: 0.100000
Training Epoch: 6 [25088/45000]	Loss: 0.2606	LR: 0.100000
Training Epoch: 6 [25344/45000]	Loss: 0.1875	LR: 0.100000
Training Epoch: 6 [25600/45000]	Loss: 0.2595	LR: 0.100000
Training Epoch: 6 [25856/45000]	Loss: 0.1938	LR: 0.100000
Training Epoch: 6 [26112/45000]	Loss: 0.2338	LR: 0.100000
Training Epoch: 6 [26368/45000]	Loss: 0.3379	LR: 0.100000
Training Epoch: 6 [26624/45000]	Loss: 0.2123	LR: 0.100000
Training Epoch: 6 [26880/45000]	Loss: 0.2185	LR: 0.100000
Training Epoch: 6 [27136/45000]	Loss: 0.2226	LR: 0.100000
Training Epoch: 6 [27392/45000]	Loss: 0.2276	LR: 0.100000
Training Epoch: 6 [27648/45000]	Loss: 0.2411	LR: 0.100000
Training Epoch: 6 [27904/45000]	Loss: 0.2680	LR: 0.100000
Training Epoch: 6 [28160/45000]	Loss: 0.2592	LR: 0.100000
Training Epoch: 6 [28416/45000]	Loss: 0.2229	LR: 0.100000
Training Epoch: 6 [28672/45000]	Loss: 0.1874	LR: 0.100000
Training Epoch: 6 [28928/45000]	Loss: 0.2452	LR: 0.100000
Training Epoch: 6 [29184/45000]	Loss: 0.2128	LR: 0.100000
Training Epoch: 6 [29440/45000]	Loss: 0.2800	LR: 0.100000
Training Epoch: 6 [29696/45000]	Loss: 0.2358	LR: 0.100000
Training Epoch: 6 [29952/45000]	Loss: 0.2012	LR: 0.100000
Training Epoch: 6 [30208/45000]	Loss: 0.2530	LR: 0.100000
Training Epoch: 6 [30464/45000]	Loss: 0.1560	LR: 0.100000
Training Epoch: 6 [30720/45000]	Loss: 0.3918	LR: 0.100000
Training Epoch: 6 [30976/45000]	Loss: 0.2068	LR: 0.100000
Training Epoch: 6 [31232/45000]	Loss: 0.2066	LR: 0.100000
Training Epoch: 6 [31488/45000]	Loss: 0.2387	LR: 0.100000
Training Epoch: 6 [31744/45000]	Loss: 0.1568	LR: 0.100000
Training Epoch: 6 [32000/45000]	Loss: 0.2201	LR: 0.100000
Training Epoch: 6 [32256/45000]	Loss: 0.2847	LR: 0.100000
Training Epoch: 6 [32512/45000]	Loss: 0.2645	LR: 0.100000
Training Epoch: 6 [32768/45000]	Loss: 0.2210	LR: 0.100000
Training Epoch: 6 [33024/45000]	Loss: 0.1988	LR: 0.100000
Training Epoch: 6 [33280/45000]	Loss: 0.3188	LR: 0.100000
Training Epoch: 6 [33536/45000]	Loss: 0.2401	LR: 0.100000
Training Epoch: 6 [33792/45000]	Loss: 0.3883	LR: 0.100000
Training Epoch: 6 [34048/45000]	Loss: 0.2939	LR: 0.100000
Training Epoch: 6 [34304/45000]	Loss: 0.3129	LR: 0.100000
Training Epoch: 6 [34560/45000]	Loss: 0.3614	LR: 0.100000
Training Epoch: 6 [34816/45000]	Loss: 0.2794	LR: 0.100000
Training Epoch: 6 [35072/45000]	Loss: 0.2843	LR: 0.100000
Training Epoch: 6 [35328/45000]	Loss: 0.2759	LR: 0.100000
Training Epoch: 6 [35584/45000]	Loss: 0.2426	LR: 0.100000
Training Epoch: 6 [35840/45000]	Loss: 0.1871	LR: 0.100000
Training Epoch: 6 [36096/45000]	Loss: 0.1631	LR: 0.100000
Training Epoch: 6 [36352/45000]	Loss: 0.1750	LR: 0.100000
Training Epoch: 6 [36608/45000]	Loss: 0.2283	LR: 0.100000
Training Epoch: 6 [36864/45000]	Loss: 0.2419	LR: 0.100000
Training Epoch: 6 [37120/45000]	Loss: 0.2246	LR: 0.100000
Training Epoch: 6 [37376/45000]	Loss: 0.2116	LR: 0.100000
Training Epoch: 6 [37632/45000]	Loss: 0.2528	LR: 0.100000
Training Epoch: 6 [37888/45000]	Loss: 0.2526	LR: 0.100000
Training Epoch: 6 [38144/45000]	Loss: 0.2088	LR: 0.100000
Training Epoch: 6 [38400/45000]	Loss: 0.2159	LR: 0.100000
Training Epoch: 6 [38656/45000]	Loss: 0.2113	LR: 0.100000
Training Epoch: 6 [38912/45000]	Loss: 0.2927	LR: 0.100000
Training Epoch: 6 [39168/45000]	Loss: 0.2603	LR: 0.100000
Training Epoch: 6 [39424/45000]	Loss: 0.2792	LR: 0.100000
Training Epoch: 6 [39680/45000]	Loss: 0.2108	LR: 0.100000
Training Epoch: 6 [39936/45000]	Loss: 0.1833	LR: 0.100000
Training Epoch: 6 [40192/45000]	Loss: 0.2566	LR: 0.100000
Training Epoch: 6 [40448/45000]	Loss: 0.1687	LR: 0.100000
Training Epoch: 6 [40704/45000]	Loss: 0.2939	LR: 0.100000
Training Epoch: 6 [40960/45000]	Loss: 0.1533	LR: 0.100000
Training Epoch: 6 [41216/45000]	Loss: 0.2491	LR: 0.100000
Training Epoch: 6 [41472/45000]	Loss: 0.3752	LR: 0.100000
Training Epoch: 6 [41728/45000]	Loss: 0.2635	LR: 0.100000
Training Epoch: 6 [41984/45000]	Loss: 0.2208	LR: 0.100000
Training Epoch: 6 [42240/45000]	Loss: 0.2674	LR: 0.100000
Training Epoch: 6 [42496/45000]	Loss: 0.1964	LR: 0.100000
Training Epoch: 6 [42752/45000]	Loss: 0.2734	LR: 0.100000
Training Epoch: 6 [43008/45000]	Loss: 0.3482	LR: 0.100000
Training Epoch: 6 [43264/45000]	Loss: 0.1803	LR: 0.100000
Training Epoch: 6 [43520/45000]	Loss: 0.2331	LR: 0.100000
Training Epoch: 6 [43776/45000]	Loss: 0.2404	LR: 0.100000
Training Epoch: 6 [44032/45000]	Loss: 0.3748	LR: 0.100000
Training Epoch: 6 [44288/45000]	Loss: 0.1718	LR: 0.100000
Training Epoch: 6 [44544/45000]	Loss: 0.2420	LR: 0.100000
Training Epoch: 6 [44800/45000]	Loss: 0.1956	LR: 0.100000
Training Epoch: 6 [45000/45000]	Loss: 0.2192	LR: 0.100000
Epoch 6 - Average Train Loss: 0.2368, Train Accuracy: 0.9173
Epoch 6 training time consumed: 323.88s
Evaluating Network.....
Test set: Epoch: 6, Average loss: 0.0008, Accuracy: 0.9325, Time consumed:23.43s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-6-best.pth
Training Epoch: 7 [256/45000]	Loss: 0.1805	LR: 0.020000
Training Epoch: 7 [512/45000]	Loss: 0.2548	LR: 0.020000
Training Epoch: 7 [768/45000]	Loss: 0.1793	LR: 0.020000
Training Epoch: 7 [1024/45000]	Loss: 0.1347	LR: 0.020000
Training Epoch: 7 [1280/45000]	Loss: 0.1473	LR: 0.020000
Training Epoch: 7 [1536/45000]	Loss: 0.0893	LR: 0.020000
Training Epoch: 7 [1792/45000]	Loss: 0.0672	LR: 0.020000
Training Epoch: 7 [2048/45000]	Loss: 0.1327	LR: 0.020000
Training Epoch: 7 [2304/45000]	Loss: 0.1521	LR: 0.020000
Training Epoch: 7 [2560/45000]	Loss: 0.1085	LR: 0.020000
Training Epoch: 7 [2816/45000]	Loss: 0.0823	LR: 0.020000
Training Epoch: 7 [3072/45000]	Loss: 0.1244	LR: 0.020000
Training Epoch: 7 [3328/45000]	Loss: 0.1446	LR: 0.020000
Training Epoch: 7 [3584/45000]	Loss: 0.1176	LR: 0.020000
Training Epoch: 7 [3840/45000]	Loss: 0.0992	LR: 0.020000
Training Epoch: 7 [4096/45000]	Loss: 0.1035	LR: 0.020000
Training Epoch: 7 [4352/45000]	Loss: 0.0972	LR: 0.020000
Training Epoch: 7 [4608/45000]	Loss: 0.1188	LR: 0.020000
Training Epoch: 7 [4864/45000]	Loss: 0.1409	LR: 0.020000
Training Epoch: 7 [5120/45000]	Loss: 0.0784	LR: 0.020000
Training Epoch: 7 [5376/45000]	Loss: 0.1202	LR: 0.020000
Training Epoch: 7 [5632/45000]	Loss: 0.1085	LR: 0.020000
Training Epoch: 7 [5888/45000]	Loss: 0.1248	LR: 0.020000
Training Epoch: 7 [6144/45000]	Loss: 0.0554	LR: 0.020000
Training Epoch: 7 [6400/45000]	Loss: 0.1117	LR: 0.020000
Training Epoch: 7 [6656/45000]	Loss: 0.1143	LR: 0.020000
Training Epoch: 7 [6912/45000]	Loss: 0.1046	LR: 0.020000
Training Epoch: 7 [7168/45000]	Loss: 0.1031	LR: 0.020000
Training Epoch: 7 [7424/45000]	Loss: 0.1153	LR: 0.020000
Training Epoch: 7 [7680/45000]	Loss: 0.0788	LR: 0.020000
Training Epoch: 7 [7936/45000]	Loss: 0.1122	LR: 0.020000
Training Epoch: 7 [8192/45000]	Loss: 0.0608	LR: 0.020000
Training Epoch: 7 [8448/45000]	Loss: 0.0772	LR: 0.020000
Training Epoch: 7 [8704/45000]	Loss: 0.1438	LR: 0.020000
Training Epoch: 7 [8960/45000]	Loss: 0.0589	LR: 0.020000
Training Epoch: 7 [9216/45000]	Loss: 0.1002	LR: 0.020000
Training Epoch: 7 [9472/45000]	Loss: 0.0500	LR: 0.020000
Training Epoch: 7 [9728/45000]	Loss: 0.0801	LR: 0.020000
Training Epoch: 7 [9984/45000]	Loss: 0.1015	LR: 0.020000
Training Epoch: 7 [10240/45000]	Loss: 0.1159	LR: 0.020000
Training Epoch: 7 [10496/45000]	Loss: 0.1317	LR: 0.020000
Training Epoch: 7 [10752/45000]	Loss: 0.0394	LR: 0.020000
Training Epoch: 7 [11008/45000]	Loss: 0.1297	LR: 0.020000
Training Epoch: 7 [11264/45000]	Loss: 0.1065	LR: 0.020000
Training Epoch: 7 [11520/45000]	Loss: 0.0944	LR: 0.020000
Training Epoch: 7 [11776/45000]	Loss: 0.0936	LR: 0.020000
Training Epoch: 7 [12032/45000]	Loss: 0.1101	LR: 0.020000
Training Epoch: 7 [12288/45000]	Loss: 0.0545	LR: 0.020000
Training Epoch: 7 [12544/45000]	Loss: 0.0655	LR: 0.020000
Training Epoch: 7 [12800/45000]	Loss: 0.1110	LR: 0.020000
Training Epoch: 7 [13056/45000]	Loss: 0.0822	LR: 0.020000
Training Epoch: 7 [13312/45000]	Loss: 0.0640	LR: 0.020000
Training Epoch: 7 [13568/45000]	Loss: 0.0581	LR: 0.020000
Training Epoch: 7 [13824/45000]	Loss: 0.0957	LR: 0.020000
Training Epoch: 7 [14080/45000]	Loss: 0.1432	LR: 0.020000
Training Epoch: 7 [14336/45000]	Loss: 0.0962	LR: 0.020000
Training Epoch: 7 [14592/45000]	Loss: 0.0810	LR: 0.020000
Training Epoch: 7 [14848/45000]	Loss: 0.0405	LR: 0.020000
Training Epoch: 7 [15104/45000]	Loss: 0.0554	LR: 0.020000
Training Epoch: 7 [15360/45000]	Loss: 0.0945	LR: 0.020000
Training Epoch: 7 [15616/45000]	Loss: 0.0789	LR: 0.020000
Training Epoch: 7 [15872/45000]	Loss: 0.0552	LR: 0.020000
Training Epoch: 7 [16128/45000]	Loss: 0.0854	LR: 0.020000
Training Epoch: 7 [16384/45000]	Loss: 0.0664	LR: 0.020000
Training Epoch: 7 [16640/45000]	Loss: 0.1201	LR: 0.020000
Training Epoch: 7 [16896/45000]	Loss: 0.0367	LR: 0.020000
Training Epoch: 7 [17152/45000]	Loss: 0.0968	LR: 0.020000
Training Epoch: 7 [17408/45000]	Loss: 0.0934	LR: 0.020000
Training Epoch: 7 [17664/45000]	Loss: 0.0628	LR: 0.020000
Training Epoch: 7 [17920/45000]	Loss: 0.0386	LR: 0.020000
Training Epoch: 7 [18176/45000]	Loss: 0.0683	LR: 0.020000
Training Epoch: 7 [18432/45000]	Loss: 0.1064	LR: 0.020000
Training Epoch: 7 [18688/45000]	Loss: 0.1172	LR: 0.020000
Training Epoch: 7 [18944/45000]	Loss: 0.0803	LR: 0.020000
Training Epoch: 7 [19200/45000]	Loss: 0.0809	LR: 0.020000
Training Epoch: 7 [19456/45000]	Loss: 0.0767	LR: 0.020000
Training Epoch: 7 [19712/45000]	Loss: 0.0759	LR: 0.020000
Training Epoch: 7 [19968/45000]	Loss: 0.1246	LR: 0.020000
Training Epoch: 7 [20224/45000]	Loss: 0.1107	LR: 0.020000
Training Epoch: 7 [20480/45000]	Loss: 0.1196	LR: 0.020000
Training Epoch: 7 [20736/45000]	Loss: 0.0765	LR: 0.020000
Training Epoch: 7 [20992/45000]	Loss: 0.0975	LR: 0.020000
Training Epoch: 7 [21248/45000]	Loss: 0.0972	LR: 0.020000
Training Epoch: 7 [21504/45000]	Loss: 0.1509	LR: 0.020000
Training Epoch: 7 [21760/45000]	Loss: 0.0974	LR: 0.020000
Training Epoch: 7 [22016/45000]	Loss: 0.1095	LR: 0.020000
Training Epoch: 7 [22272/45000]	Loss: 0.0991	LR: 0.020000
Training Epoch: 7 [22528/45000]	Loss: 0.0629	LR: 0.020000
Training Epoch: 7 [22784/45000]	Loss: 0.0609	LR: 0.020000
Training Epoch: 7 [23040/45000]	Loss: 0.1703	LR: 0.020000
Training Epoch: 7 [23296/45000]	Loss: 0.1255	LR: 0.020000
Training Epoch: 7 [23552/45000]	Loss: 0.1066	LR: 0.020000
Training Epoch: 7 [23808/45000]	Loss: 0.1372	LR: 0.020000
Training Epoch: 7 [24064/45000]	Loss: 0.0799	LR: 0.020000
Training Epoch: 7 [24320/45000]	Loss: 0.1065	LR: 0.020000
Training Epoch: 7 [24576/45000]	Loss: 0.0507	LR: 0.020000
Training Epoch: 7 [24832/45000]	Loss: 0.0427	LR: 0.020000
Training Epoch: 7 [25088/45000]	Loss: 0.1100	LR: 0.020000
Training Epoch: 7 [25344/45000]	Loss: 0.0955	LR: 0.020000
Training Epoch: 7 [25600/45000]	Loss: 0.0539	LR: 0.020000
Training Epoch: 7 [25856/45000]	Loss: 0.0505	LR: 0.020000
Training Epoch: 7 [26112/45000]	Loss: 0.0926	LR: 0.020000
Training Epoch: 7 [26368/45000]	Loss: 0.1190	LR: 0.020000
Training Epoch: 7 [26624/45000]	Loss: 0.0968	LR: 0.020000
Training Epoch: 7 [26880/45000]	Loss: 0.1055	LR: 0.020000
Training Epoch: 7 [27136/45000]	Loss: 0.0621	LR: 0.020000
Training Epoch: 7 [27392/45000]	Loss: 0.0854	LR: 0.020000
Training Epoch: 7 [27648/45000]	Loss: 0.0626	LR: 0.020000
Training Epoch: 7 [27904/45000]	Loss: 0.0671	LR: 0.020000
Training Epoch: 7 [28160/45000]	Loss: 0.0723	LR: 0.020000
Training Epoch: 7 [28416/45000]	Loss: 0.0899	LR: 0.020000
Training Epoch: 7 [28672/45000]	Loss: 0.0985	LR: 0.020000
Training Epoch: 7 [28928/45000]	Loss: 0.0674	LR: 0.020000
Training Epoch: 7 [29184/45000]	Loss: 0.0939	LR: 0.020000
Training Epoch: 7 [29440/45000]	Loss: 0.0891	LR: 0.020000
Training Epoch: 7 [29696/45000]	Loss: 0.0821	LR: 0.020000
Training Epoch: 7 [29952/45000]	Loss: 0.0922	LR: 0.020000
Training Epoch: 7 [30208/45000]	Loss: 0.0817	LR: 0.020000
Training Epoch: 7 [30464/45000]	Loss: 0.0643	LR: 0.020000
Training Epoch: 7 [30720/45000]	Loss: 0.1365	LR: 0.020000
Training Epoch: 7 [30976/45000]	Loss: 0.0527	LR: 0.020000
Training Epoch: 7 [31232/45000]	Loss: 0.0879	LR: 0.020000
Training Epoch: 7 [31488/45000]	Loss: 0.1167	LR: 0.020000
Training Epoch: 7 [31744/45000]	Loss: 0.1195	LR: 0.020000
Training Epoch: 7 [32000/45000]	Loss: 0.0543	LR: 0.020000
Training Epoch: 7 [32256/45000]	Loss: 0.1021	LR: 0.020000
Training Epoch: 7 [32512/45000]	Loss: 0.0542	LR: 0.020000
Training Epoch: 7 [32768/45000]	Loss: 0.0783	LR: 0.020000
Training Epoch: 7 [33024/45000]	Loss: 0.0523	LR: 0.020000
Training Epoch: 7 [33280/45000]	Loss: 0.0600	LR: 0.020000
Training Epoch: 7 [33536/45000]	Loss: 0.1334	LR: 0.020000
Training Epoch: 7 [33792/45000]	Loss: 0.1057	LR: 0.020000
Training Epoch: 7 [34048/45000]	Loss: 0.0487	LR: 0.020000
Training Epoch: 7 [34304/45000]	Loss: 0.0377	LR: 0.020000
Training Epoch: 7 [34560/45000]	Loss: 0.0817	LR: 0.020000
Training Epoch: 7 [34816/45000]	Loss: 0.0665	LR: 0.020000
Training Epoch: 7 [35072/45000]	Loss: 0.0791	LR: 0.020000
Training Epoch: 7 [35328/45000]	Loss: 0.0947	LR: 0.020000
Training Epoch: 7 [35584/45000]	Loss: 0.1001	LR: 0.020000
Training Epoch: 7 [35840/45000]	Loss: 0.1170	LR: 0.020000
Training Epoch: 7 [36096/45000]	Loss: 0.1238	LR: 0.020000
Training Epoch: 7 [36352/45000]	Loss: 0.0897	LR: 0.020000
Training Epoch: 7 [36608/45000]	Loss: 0.0616	LR: 0.020000
Training Epoch: 7 [36864/45000]	Loss: 0.0552	LR: 0.020000
Training Epoch: 7 [37120/45000]	Loss: 0.0876	LR: 0.020000
Training Epoch: 7 [37376/45000]	Loss: 0.0655	LR: 0.020000
Training Epoch: 7 [37632/45000]	Loss: 0.0621	LR: 0.020000
Training Epoch: 7 [37888/45000]	Loss: 0.0905	LR: 0.020000
Training Epoch: 7 [38144/45000]	Loss: 0.0695	LR: 0.020000
Training Epoch: 7 [38400/45000]	Loss: 0.0597	LR: 0.020000
Training Epoch: 7 [38656/45000]	Loss: 0.1209	LR: 0.020000
Training Epoch: 7 [38912/45000]	Loss: 0.0753	LR: 0.020000
Training Epoch: 7 [39168/45000]	Loss: 0.0729	LR: 0.020000
Training Epoch: 7 [39424/45000]	Loss: 0.0736	LR: 0.020000
Training Epoch: 7 [39680/45000]	Loss: 0.1105	LR: 0.020000
Training Epoch: 7 [39936/45000]	Loss: 0.0863	LR: 0.020000
Training Epoch: 7 [40192/45000]	Loss: 0.0518	LR: 0.020000
Training Epoch: 7 [40448/45000]	Loss: 0.0405	LR: 0.020000
Training Epoch: 7 [40704/45000]	Loss: 0.1039	LR: 0.020000
Training Epoch: 7 [40960/45000]	Loss: 0.1004	LR: 0.020000
Training Epoch: 7 [41216/45000]	Loss: 0.1225	LR: 0.020000
Training Epoch: 7 [41472/45000]	Loss: 0.1399	LR: 0.020000
Training Epoch: 7 [41728/45000]	Loss: 0.0948	LR: 0.020000
Training Epoch: 7 [41984/45000]	Loss: 0.0623	LR: 0.020000
Training Epoch: 7 [42240/45000]	Loss: 0.1010	LR: 0.020000
Training Epoch: 7 [42496/45000]	Loss: 0.0668	LR: 0.020000
Training Epoch: 7 [42752/45000]	Loss: 0.1015	LR: 0.020000
Training Epoch: 7 [43008/45000]	Loss: 0.0848	LR: 0.020000
Training Epoch: 7 [43264/45000]	Loss: 0.1225	LR: 0.020000
Training Epoch: 7 [43520/45000]	Loss: 0.1127	LR: 0.020000
Training Epoch: 7 [43776/45000]	Loss: 0.0847	LR: 0.020000
Training Epoch: 7 [44032/45000]	Loss: 0.0744	LR: 0.020000
Training Epoch: 7 [44288/45000]	Loss: 0.0867	LR: 0.020000
Training Epoch: 7 [44544/45000]	Loss: 0.1145	LR: 0.020000
Training Epoch: 7 [44800/45000]	Loss: 0.1039	LR: 0.020000
Training Epoch: 7 [45000/45000]	Loss: 0.1099	LR: 0.020000
Epoch 7 - Average Train Loss: 0.0932, Train Accuracy: 0.9679
Epoch 7 training time consumed: 324.42s
Evaluating Network.....
Test set: Epoch: 7, Average loss: 0.0004, Accuracy: 0.9648, Time consumed:23.43s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-7-best.pth
Training Epoch: 8 [256/45000]	Loss: 0.0874	LR: 0.020000
Training Epoch: 8 [512/45000]	Loss: 0.0792	LR: 0.020000
Training Epoch: 8 [768/45000]	Loss: 0.1021	LR: 0.020000
Training Epoch: 8 [1024/45000]	Loss: 0.0541	LR: 0.020000
Training Epoch: 8 [1280/45000]	Loss: 0.0635	LR: 0.020000
Training Epoch: 8 [1536/45000]	Loss: 0.0689	LR: 0.020000
Training Epoch: 8 [1792/45000]	Loss: 0.0706	LR: 0.020000
Training Epoch: 8 [2048/45000]	Loss: 0.0991	LR: 0.020000
Training Epoch: 8 [2304/45000]	Loss: 0.0496	LR: 0.020000
Training Epoch: 8 [2560/45000]	Loss: 0.0466	LR: 0.020000
Training Epoch: 8 [2816/45000]	Loss: 0.0776	LR: 0.020000
Training Epoch: 8 [3072/45000]	Loss: 0.0400	LR: 0.020000
Training Epoch: 8 [3328/45000]	Loss: 0.0444	LR: 0.020000
Training Epoch: 8 [3584/45000]	Loss: 0.0598	LR: 0.020000
Training Epoch: 8 [3840/45000]	Loss: 0.0543	LR: 0.020000
Training Epoch: 8 [4096/45000]	Loss: 0.0632	LR: 0.020000
Training Epoch: 8 [4352/45000]	Loss: 0.0742	LR: 0.020000
Training Epoch: 8 [4608/45000]	Loss: 0.0360	LR: 0.020000
Training Epoch: 8 [4864/45000]	Loss: 0.0381	LR: 0.020000
Training Epoch: 8 [5120/45000]	Loss: 0.0857	LR: 0.020000
Training Epoch: 8 [5376/45000]	Loss: 0.0625	LR: 0.020000
Training Epoch: 8 [5632/45000]	Loss: 0.0710	LR: 0.020000
Training Epoch: 8 [5888/45000]	Loss: 0.0491	LR: 0.020000
Training Epoch: 8 [6144/45000]	Loss: 0.1061	LR: 0.020000
Training Epoch: 8 [6400/45000]	Loss: 0.0518	LR: 0.020000
Training Epoch: 8 [6656/45000]	Loss: 0.0434	LR: 0.020000
Training Epoch: 8 [6912/45000]	Loss: 0.0223	LR: 0.020000
Training Epoch: 8 [7168/45000]	Loss: 0.0670	LR: 0.020000
Training Epoch: 8 [7424/45000]	Loss: 0.0762	LR: 0.020000
Training Epoch: 8 [7680/45000]	Loss: 0.0581	LR: 0.020000
Training Epoch: 8 [7936/45000]	Loss: 0.0522	LR: 0.020000
Training Epoch: 8 [8192/45000]	Loss: 0.0725	LR: 0.020000
Training Epoch: 8 [8448/45000]	Loss: 0.0528	LR: 0.020000
Training Epoch: 8 [8704/45000]	Loss: 0.0520	LR: 0.020000
Training Epoch: 8 [8960/45000]	Loss: 0.0903	LR: 0.020000
Training Epoch: 8 [9216/45000]	Loss: 0.0718	LR: 0.020000
Training Epoch: 8 [9472/45000]	Loss: 0.0596	LR: 0.020000
Training Epoch: 8 [9728/45000]	Loss: 0.0428	LR: 0.020000
Training Epoch: 8 [9984/45000]	Loss: 0.0430	LR: 0.020000
Training Epoch: 8 [10240/45000]	Loss: 0.0500	LR: 0.020000
Training Epoch: 8 [10496/45000]	Loss: 0.0625	LR: 0.020000
Training Epoch: 8 [10752/45000]	Loss: 0.0928	LR: 0.020000
Training Epoch: 8 [11008/45000]	Loss: 0.0614	LR: 0.020000
Training Epoch: 8 [11264/45000]	Loss: 0.0692	LR: 0.020000
Training Epoch: 8 [11520/45000]	Loss: 0.0809	LR: 0.020000
Training Epoch: 8 [11776/45000]	Loss: 0.0412	LR: 0.020000
Training Epoch: 8 [12032/45000]	Loss: 0.0929	LR: 0.020000
Training Epoch: 8 [12288/45000]	Loss: 0.0415	LR: 0.020000
Training Epoch: 8 [12544/45000]	Loss: 0.0320	LR: 0.020000
Training Epoch: 8 [12800/45000]	Loss: 0.0419	LR: 0.020000
Training Epoch: 8 [13056/45000]	Loss: 0.0682	LR: 0.020000
Training Epoch: 8 [13312/45000]	Loss: 0.0366	LR: 0.020000
Training Epoch: 8 [13568/45000]	Loss: 0.0552	LR: 0.020000
Training Epoch: 8 [13824/45000]	Loss: 0.0688	LR: 0.020000
Training Epoch: 8 [14080/45000]	Loss: 0.0619	LR: 0.020000
Training Epoch: 8 [14336/45000]	Loss: 0.0246	LR: 0.020000
Training Epoch: 8 [14592/45000]	Loss: 0.0490	LR: 0.020000
Training Epoch: 8 [14848/45000]	Loss: 0.1161	LR: 0.020000
Training Epoch: 8 [15104/45000]	Loss: 0.0525	LR: 0.020000
Training Epoch: 8 [15360/45000]	Loss: 0.0430	LR: 0.020000
Training Epoch: 8 [15616/45000]	Loss: 0.0657	LR: 0.020000
Training Epoch: 8 [15872/45000]	Loss: 0.0467	LR: 0.020000
Training Epoch: 8 [16128/45000]	Loss: 0.0579	LR: 0.020000
Training Epoch: 8 [16384/45000]	Loss: 0.0800	LR: 0.020000
Training Epoch: 8 [16640/45000]	Loss: 0.0550	LR: 0.020000
Training Epoch: 8 [16896/45000]	Loss: 0.0351	LR: 0.020000
Training Epoch: 8 [17152/45000]	Loss: 0.0310	LR: 0.020000
Training Epoch: 8 [17408/45000]	Loss: 0.0489	LR: 0.020000
Training Epoch: 8 [17664/45000]	Loss: 0.0775	LR: 0.020000
Training Epoch: 8 [17920/45000]	Loss: 0.0368	LR: 0.020000
Training Epoch: 8 [18176/45000]	Loss: 0.0590	LR: 0.020000
Training Epoch: 8 [18432/45000]	Loss: 0.0360	LR: 0.020000
Training Epoch: 8 [18688/45000]	Loss: 0.0473	LR: 0.020000
Training Epoch: 8 [18944/45000]	Loss: 0.0571	LR: 0.020000
Training Epoch: 8 [19200/45000]	Loss: 0.0738	LR: 0.020000
Training Epoch: 8 [19456/45000]	Loss: 0.0407	LR: 0.020000
Training Epoch: 8 [19712/45000]	Loss: 0.0533	LR: 0.020000
Training Epoch: 8 [19968/45000]	Loss: 0.0639	LR: 0.020000
Training Epoch: 8 [20224/45000]	Loss: 0.0668	LR: 0.020000
Training Epoch: 8 [20480/45000]	Loss: 0.0386	LR: 0.020000
Training Epoch: 8 [20736/45000]	Loss: 0.0628	LR: 0.020000
Training Epoch: 8 [20992/45000]	Loss: 0.0424	LR: 0.020000
Training Epoch: 8 [21248/45000]	Loss: 0.0760	LR: 0.020000
Training Epoch: 8 [21504/45000]	Loss: 0.0576	LR: 0.020000
Training Epoch: 8 [21760/45000]	Loss: 0.0736	LR: 0.020000
Training Epoch: 8 [22016/45000]	Loss: 0.0619	LR: 0.020000
Training Epoch: 8 [22272/45000]	Loss: 0.0602	LR: 0.020000
Training Epoch: 8 [22528/45000]	Loss: 0.0252	LR: 0.020000
Training Epoch: 8 [22784/45000]	Loss: 0.0654	LR: 0.020000
Training Epoch: 8 [23040/45000]	Loss: 0.0522	LR: 0.020000
Training Epoch: 8 [23296/45000]	Loss: 0.0434	LR: 0.020000
Training Epoch: 8 [23552/45000]	Loss: 0.0963	LR: 0.020000
Training Epoch: 8 [23808/45000]	Loss: 0.0882	LR: 0.020000
Training Epoch: 8 [24064/45000]	Loss: 0.0679	LR: 0.020000
Training Epoch: 8 [24320/45000]	Loss: 0.0821	LR: 0.020000
Training Epoch: 8 [24576/45000]	Loss: 0.0572	LR: 0.020000
Training Epoch: 8 [24832/45000]	Loss: 0.0580	LR: 0.020000
Training Epoch: 8 [25088/45000]	Loss: 0.0516	LR: 0.020000
Training Epoch: 8 [25344/45000]	Loss: 0.0645	LR: 0.020000
Training Epoch: 8 [25600/45000]	Loss: 0.0813	LR: 0.020000
Training Epoch: 8 [25856/45000]	Loss: 0.1115	LR: 0.020000
Training Epoch: 8 [26112/45000]	Loss: 0.0902	LR: 0.020000
Training Epoch: 8 [26368/45000]	Loss: 0.0716	LR: 0.020000
Training Epoch: 8 [26624/45000]	Loss: 0.0529	LR: 0.020000
Training Epoch: 8 [26880/45000]	Loss: 0.0557	LR: 0.020000
Training Epoch: 8 [27136/45000]	Loss: 0.0738	LR: 0.020000
Training Epoch: 8 [27392/45000]	Loss: 0.0590	LR: 0.020000
Training Epoch: 8 [27648/45000]	Loss: 0.0795	LR: 0.020000
Training Epoch: 8 [27904/45000]	Loss: 0.0539	LR: 0.020000
Training Epoch: 8 [28160/45000]	Loss: 0.0546	LR: 0.020000
Training Epoch: 8 [28416/45000]	Loss: 0.0500	LR: 0.020000
Training Epoch: 8 [28672/45000]	Loss: 0.0728	LR: 0.020000
Training Epoch: 8 [28928/45000]	Loss: 0.0728	LR: 0.020000
Training Epoch: 8 [29184/45000]	Loss: 0.0319	LR: 0.020000
Training Epoch: 8 [29440/45000]	Loss: 0.0556	LR: 0.020000
Training Epoch: 8 [29696/45000]	Loss: 0.1058	LR: 0.020000
Training Epoch: 8 [29952/45000]	Loss: 0.0611	LR: 0.020000
Training Epoch: 8 [30208/45000]	Loss: 0.0527	LR: 0.020000
Training Epoch: 8 [30464/45000]	Loss: 0.0803	LR: 0.020000
Training Epoch: 8 [30720/45000]	Loss: 0.0855	LR: 0.020000
Training Epoch: 8 [30976/45000]	Loss: 0.0424	LR: 0.020000
Training Epoch: 8 [31232/45000]	Loss: 0.0459	LR: 0.020000
Training Epoch: 8 [31488/45000]	Loss: 0.0603	LR: 0.020000
Training Epoch: 8 [31744/45000]	Loss: 0.1024	LR: 0.020000
Training Epoch: 8 [32000/45000]	Loss: 0.0810	LR: 0.020000
Training Epoch: 8 [32256/45000]	Loss: 0.0492	LR: 0.020000
Training Epoch: 8 [32512/45000]	Loss: 0.0746	LR: 0.020000
Training Epoch: 8 [32768/45000]	Loss: 0.0641	LR: 0.020000
Training Epoch: 8 [33024/45000]	Loss: 0.0548	LR: 0.020000
Training Epoch: 8 [33280/45000]	Loss: 0.0545	LR: 0.020000
Training Epoch: 8 [33536/45000]	Loss: 0.0859	LR: 0.020000
Training Epoch: 8 [33792/45000]	Loss: 0.0446	LR: 0.020000
Training Epoch: 8 [34048/45000]	Loss: 0.0625	LR: 0.020000
Training Epoch: 8 [34304/45000]	Loss: 0.0471	LR: 0.020000
Training Epoch: 8 [34560/45000]	Loss: 0.0676	LR: 0.020000
Training Epoch: 8 [34816/45000]	Loss: 0.0816	LR: 0.020000
Training Epoch: 8 [35072/45000]	Loss: 0.0792	LR: 0.020000
Training Epoch: 8 [35328/45000]	Loss: 0.0522	LR: 0.020000
Training Epoch: 8 [35584/45000]	Loss: 0.1046	LR: 0.020000
Training Epoch: 8 [35840/45000]	Loss: 0.0796	LR: 0.020000
Training Epoch: 8 [36096/45000]	Loss: 0.0422	LR: 0.020000
Training Epoch: 8 [36352/45000]	Loss: 0.0527	LR: 0.020000
Training Epoch: 8 [36608/45000]	Loss: 0.0807	LR: 0.020000
Training Epoch: 8 [36864/45000]	Loss: 0.0594	LR: 0.020000
Training Epoch: 8 [37120/45000]	Loss: 0.0652	LR: 0.020000
Training Epoch: 8 [37376/45000]	Loss: 0.0689	LR: 0.020000
Training Epoch: 8 [37632/45000]	Loss: 0.0783	LR: 0.020000
Training Epoch: 8 [37888/45000]	Loss: 0.0637	LR: 0.020000
Training Epoch: 8 [38144/45000]	Loss: 0.0602	LR: 0.020000
Training Epoch: 8 [38400/45000]	Loss: 0.0350	LR: 0.020000
Training Epoch: 8 [38656/45000]	Loss: 0.0598	LR: 0.020000
Training Epoch: 8 [38912/45000]	Loss: 0.0675	LR: 0.020000
Training Epoch: 8 [39168/45000]	Loss: 0.0929	LR: 0.020000
Training Epoch: 8 [39424/45000]	Loss: 0.0481	LR: 0.020000
Training Epoch: 8 [39680/45000]	Loss: 0.0212	LR: 0.020000
Training Epoch: 8 [39936/45000]	Loss: 0.0380	LR: 0.020000
Training Epoch: 8 [40192/45000]	Loss: 0.0363	LR: 0.020000
Training Epoch: 8 [40448/45000]	Loss: 0.0476	LR: 0.020000
Training Epoch: 8 [40704/45000]	Loss: 0.0368	LR: 0.020000
Training Epoch: 8 [40960/45000]	Loss: 0.0747	LR: 0.020000
Training Epoch: 8 [41216/45000]	Loss: 0.0629	LR: 0.020000
Training Epoch: 8 [41472/45000]	Loss: 0.0334	LR: 0.020000
Training Epoch: 8 [41728/45000]	Loss: 0.0758	LR: 0.020000
Training Epoch: 8 [41984/45000]	Loss: 0.0870	LR: 0.020000
Training Epoch: 8 [42240/45000]	Loss: 0.0398	LR: 0.020000
Training Epoch: 8 [42496/45000]	Loss: 0.0934	LR: 0.020000
Training Epoch: 8 [42752/45000]	Loss: 0.1067	LR: 0.020000
Training Epoch: 8 [43008/45000]	Loss: 0.0594	LR: 0.020000
Training Epoch: 8 [43264/45000]	Loss: 0.0569	LR: 0.020000
Training Epoch: 8 [43520/45000]	Loss: 0.0815	LR: 0.020000
Training Epoch: 8 [43776/45000]	Loss: 0.0976	LR: 0.020000
Training Epoch: 8 [44032/45000]	Loss: 0.0518	LR: 0.020000
Training Epoch: 8 [44288/45000]	Loss: 0.0664	LR: 0.020000
Training Epoch: 8 [44544/45000]	Loss: 0.0586	LR: 0.020000
Training Epoch: 8 [44800/45000]	Loss: 0.0666	LR: 0.020000
Training Epoch: 8 [45000/45000]	Loss: 0.0677	LR: 0.020000
Epoch 8 - Average Train Loss: 0.0623, Train Accuracy: 0.9778
Epoch 8 training time consumed: 324.15s
Evaluating Network.....
Test set: Epoch: 8, Average loss: 0.0004, Accuracy: 0.9662, Time consumed:23.42s
Saving weights file to checkpoint/retrain/ViT/Thursday_17_July_2025_00h_03m_36s/ViT-Cifar10-seed1-ret100-8-best.pth
Valid (Test) Dl:  10000
Train Dl:  50000
Retain Train Dl:  45000
Forget Train Dl:  5000
Retain Valid Dl:  45000
Forget Valid Dl:  5000
retain_prob Distribution: 10000 samples
test_prob Distribution: 10000 samples
forget_prob Distribution: 5000 samples
Set1 Distribution: 5000 samples
Set2 Distribution: 5000 samples
Set1 Distribution: 5000 samples
Set2 Distribution: 5000 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Test Accuracy: 96.69921875
Retain Accuracy: 98.01207733154297
Zero-Retain Forget (ZRF): 0.7435938119888306
Membership Inference Attack (MIA): 0.7926
Forget vs Retain Membership Inference Attack (MIA): 0.537
Forget vs Test Membership Inference Attack (MIA): 0.5215
Test vs Retain Membership Inference Attack (MIA): 0.512
Train vs Test Membership Inference Attack (MIA): 0.5135
Forget Set Accuracy (Df): 95.79044342041016
Method Execution Time: 5174.59 seconds
